Plant diacylglycerol O-acyltransferase and uses thereof

ABSTRACT

Plant nucleic acid compositions encoding polypeptide products with diacylglyceride acyltransferase (DGAT) activity, as well as the polypeptide products encoded thereby and methods for producing the same, are provided. Methods and compositions for modulating DGAT activity in a plant, particularly DGAT activity in plant seeds, and transgenic plants with altered DGAT activity are provided. Such plants and seeds are useful in the production of human food and animal feedstuff, and have several other industrial applications. Also provided are methods for making triglycerides and triglyceride compositions, as well as the compositions produced by these methods. The subject methods and compositions find use in a variety of different applications, including research, medicine, agriculture and industry.

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation-in-part of application Ser. No. 10/040,315 filed Oct. 29, 2001; which application is: (a) a continuation-in-part of application Ser. No. 09/339,472 filed on Jun. 23, 1999, which application claims priority to the filing date of United States Provisional Patent Application Serial No. 60/107,771 filed Nov. 9, 1998; and (b) a continuation-in-part of PCT application serial no. PCT/US98/17883, filed Aug. 28, 1998, which application is a continuation in part of application Ser. No. 09/103,754, now U.S. Pat. No. 6,344,548, filed Jun. 24, 1998; the disclosures of which applications are herein incorporated by reference.

ACKNOWLEDGMENT OF GOVERNMENT SUPPORT

[0002] This invention was made with Government support under Grant Nos. R01 52069 and R01 57170 awarded by the National Institute of health and support from the Veterans Administration. The Government has certain rights in this invention.

INTRODUCTION

[0003] 1. Field of the Invention

[0004] The field of the invention is plant enzymes, particularly plant acyltransferases.

[0005] 2. Background of the Invention

[0006] Triacylglycerol is synthesized by the sequential transfer of acyl chains to a glycerol backbone by a series of enzymes in the Kennedy pathway. These enzymes are glycerol-3-phosphate acyltransferase (which adds a first acyl chain to a glycerol backbone to form a glycerol-3-phosphate), lysophosphatidic acid acyltransferase (which adds a second acyl chain to glycerol-3-phosphate to form diacylglycerol) and diacylglycerol acyltransferase (DGAT), which transfers a third acyl chain to diacylglycerol to form triacylglycerol (TAG) (see Topfer el al. Science 1995 268: 681-686).

[0007] Because of its high carbon content, TAG is an important molecule for storage of energy in plants, particularly in seeds where it is used as an energy source for germination. Because of its unique properties, TAG is a valuable component of animal feedstuffs and a major component of the purified vegetable oils consumed by man. In addition, many TAGs have other uses, including as a “bio-fuel”, in industrial lubricants, surfactants, paints and varnishes, and in various soaps and cosmetics. The ability to manipulate the biosynthesis of TAG using DGAT is a major goal of plant biotechnology.

[0008] Because the reaction catalyzed by DGAT is unique to the Kennedy pathway, the reaction is thought to be at a critical branchpoint in glycerolipid biosynthesis. Enzymes at such branchpoints are considered prime candidates for sites of metabolic engineering, and, as such, DGAT has become an enzyme of intense interest because of its potential as a regulator of TAG biosynthesis. Furthermore, several biochemical studies indicate that TAG enzymes from different plant species are specific for certain acyl chains, suggesting that the plant DGAT could be used to manipulate the levels of saturated fatty acids in an oil or acyl chain length in TAG, in addition to the absolute levels of TAG biosynthesis.

[0009] Because of its central role in the regulation of TAG biosynthesis in plants, there is much interest in the identification of plant DGAT polypeptides and polynucleotides that encode them. Previous attempts to identify plant DGAT polypeptides have, in general, failed, primarily because the DGAT enzyme is difficult to purify. As such, a great need still exists for plant DGAT proteins and polynucleotides that encode them. The present invention addresses these, and other, needs.

[0010] Relevant Literature

[0011] References of interest include: U.S. Pat. No. 6,100,077; Cases et al., the FASEB Journal (Mar. 20, 1998) Vol. 12., No.(5):A814; Cases et al., Proc. Natl. Acad. Sci. USA (October 1998) 95:13018-13023; Garver et al., Analytical Biochemistry (1992) 207: 335-340; Shockey et al., Plant Phys. (1995) 107: 155-160; Little et al Biochem. J. (1994) 304: 951-958; Kamisaki at al., J. Biochem (1994) 116:1295-1301; Katavic et al., Plant Phys. (1995) 108:399-409 Zou et al., Plant Mol. Bio. (1996) 31:429-433; Kamisaki et al J. Biochem (1996) 119: 520-523; Oelkers et al J. Bio. Chem. (1988) 41: 26765-26771; Kamisaki et al., J. Biochem (1997) 121: 1107-1114; Lassner et al., The Plant Cell (1996) 8: 281-292) and Genbank Accession No.AF078752 (Nov. 11, 1998); Genbank Accession No. AAC63997 (Oct. 15, 1998); and Genbank Accession No. AF059202 (Oct. 15, 1998).

SUMMARY OF THE INVENTION

[0012] Plant nucleic acid compositions encoding polypeptide products with diacylglyceride acyltransferase (DGAT) activity, as well as the polypeptide products encoded thereby and methods for producing the same, are provided. Methods and compositions for modulating DGAT activity in a plant, particularly DGAT activity in plant seeds, and transgenic plants with altered DGAT activity are provided. Such plants and seeds are useful in the production of human food and animal feedstuff, and have several other industrial applications. Also provided are methods for making triglycerides and triglyceride compositions, as well as the compositions produced by these methods. The subject methods and compositions find use in a variety of different applications, including research, medicine, agriculture and industry.

BRIEF DESCRIPTION OF THE FIGURES

[0013]FIG. 1 shows Arabidopsis thaliana Genbank accession number AA042298 (SEQ ID NO:1)

[0014]FIG. 2 shows Arabidopsis thaliana Genbank accession number ATH238008 (SEQ ID NO:2)

[0015]FIG. 3 shows Arabidopsis thaliana Genbank accession number CAB45373 (SEQ ID NO:3)

[0016]FIG. 4 shows Brassica napus Genbank accession number AF251794 (SEQ ID NO:4)

[0017]FIG. 5 shows Brassica napus Genbank accession number AAF64065 (SEQ ID NO:5)

[0018]FIG. 6 shows Brassica napus Genbank accession number AF155224 (SEQ ID NO:6)

[0019]FIG. 7 shows Brassica napus Genbank accession number AAD40881 (SEQ ID NO:7)

[0020]FIG. 8 shows Brassica napus Genbank accession number AF164434 (SEQ ID NO:8)

[0021]FIG. 9 shows Brassica napus Genbank accession number AAD45536 (SEQ ID NO:9)

[0022]FIG. 10 shows Tropaeolum majus Genbank accession number AY084052 (SEQ ID NO:10)

[0023]FIG. 11 shows Tropaeolum majus Genbank accession number AAM03340 (SEQ ID NO:11)

[0024]FIG. 12 shows Nicotiana tabacum Genbank accession number AF1 29003 (SEQ ID NO:12)

[0025]FIG. 13 shows Nicotiana tabacum Genbank accession number AAF19345 (SEQ ID NO:13)

[0026]FIG. 14 shows Perilla frutescens Genbank accession number AF298815 (SEQ ID NO:14)

[0027]FIG. 15 shows Perilla frutescens Genbank accession number AAG23696 (SEQ ID NO:15)

[0028]FIG. 16 shows Zea mays Genbank accession number AY110660 (SEQ ID NO:16)

[0029]FIG. 17 shows Zea mays Genbank accession number PCO148220 (SEQ ID NO: 17)

DETAILED DESCRIPTION OF THE INVENTION

[0030] Plant nucleic acid compositions encoding polypeptide products with diacylglyceride acyltransferase (DGAT) activity, as well as the polypeptide products encoded thereby and methods for producing the same, are provided. Methods and compositions for modulating DGAT activity in a plant, particularly DGAT activity in plant seeds, and transgenic plants with altered DGAT activity are provided. Such plants and seeds are useful in the production of human food and animal feedstuff and have several other industrial applications. Also provided are methods for making triglycerides and triglyceride compositions, as well as the compositions produced by these methods. The subject methods and compositions find use in a variety of different applications, including research, medicine, agriculture and industry.

[0031] Before the subject invention is described further, it is to be understood that the invention is not limited to the particular embodiments of the invention described below, as variations of the particular embodiments may be made and still fall within the scope of the appended claims. It is also to be understood that the terminology employed is for the purpose of describing particular embodiments, and is not intended to be limiting. Instead, the scope of the present invention will be established by the appended claims.

[0032] In this specification and the appended claims, the singular forms “a,” “an” and “the” include plural reference unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs.

[0033] Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range, and any other stated or intervening value in that stated range, is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges, and are also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.

[0034] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs. Although any methods, devices and materials similar or equivalent to those described herein can be used in the practice or testing of the invention, the preferred methods, devices and materials are now described.

[0035] All publications mentioned herein are incorporated herein by reference for the purpose of describing and disclosing the subject components of the invention that are described in the publications, which components might be used in connection with the presently described invention.

[0036] Plant DGAT Nucleic Acid Compositions

[0037] Plant nucleic acid compositions encoding polypeptide products, as well as fragments thereof, having diglyceride acetyltransferase (DGAT) activity are provided. By plant nucleic acid composition is meant a composition comprising a sequence of DNA having an open reading frame that encodes a plant DGAT polypeptide, i.e. a gene from a plant encoding a polypeptide having DGAT activity, and is capable, under appropriate conditions, of being expressed as a DGAT polypeptide. Also encompassed in this term are plant nucleic acids that are homologous or substantially similar or identical to the nucleic acids encoding DGAT polypeptides or proteins. Thus, the subject invention provides genes encoding plant DGAT, such as genes encoding monocot DGAT and homologs thereof and dicot DGAT and homologs thereof, as well as Arabidopsis, canola and corn DGATs and homologs thereof. In other words, both monocot and dicot genes encoding DGAT proteins are provided by the subject invention.

[0038] The source of plant DGAT-encoding nucleic acids may be any plant species, including both monocot and dicots, and in particular cells from agriculturally important plant species, including, but not limited to: crops such as soybean, wheat, corn, potato, cotton, rice, oilseed rape (including canola), sunflower, alfalfa, sugarcane, castor and turf; or fruits and vegetables, such as banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits (such as apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, brussel sprouts and kohlrabi). Other crops, fruits and vegetables whose cells may be targetted include barley, rye, millet, sorghum, currant, avocado, citrus fruits such as oranges, lemons, grapefruit and tangerines, artichoke, cherries, nuts such as the walnut and peanut, endive, leek, roots, such as arrowroot, beet, cassava, turnip, radish, yarn, and sweet potato, Arabidopsis, beans, mint and other labiates. Woody species, such as pine, poplar, yew, rubber, palm, eucalyptus etc. and lower plants such as mosses, ferns, and algae are also sources. A source of DGAT genes may also be selected from various plant families including Brassicaceae, Compositae, Euphorbiaceae, Leguminosae, Linaceae, Malvaceae, Umbilliferae and Graminae. Of particular interest are DGAT genes from oilseed crops, such as canola, soybean and corn, and DGAT genes from jajoba, coconut and meadowfoam, Limnanthes alba.

[0039] In certain embodiments, the coding sequence of the Arabidopsis thaliana DGAT gene is of interest, i.e. the A. thaliana cDNA encoding the A. thaliana DGAT enzyme, includes or comprises a nucleic acid sequence substantially the same as or identical to that identified as SEQ ID NO:1 infra. In certain other embodiments, the nucleic acid sequence encoding the full length DGAT enzyme from A. thaliana (SEQ ID NO:2), and the nucleic acid sequences encoding DGAT enzymes from Brassica napus (canola; SEQ ID NOS :4, 6 and 8), Tropaeolum majus (nasturtium; SEQ ID NO:10), Perilla frutescens (perilla; SEQ ID NO:12), Nicotiana tabacum (tabacco; SEQ ID NO:14) and Zea mays (corn; SEQ ID NO:16 and SEQ ID NO:17) are of interest.

[0040] Between plant species, e.g., Arabidopsis and tomato, or corn and rice, homologs have substantial sequence similarity, e.g. at least 75% sequence identity, usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence similarity is calculated based on a reference sequence, which may be a subset of a larger sequence, such as a conserved motif, coding region, flanking region, etc. A reference sequence will usually be at least about 18 nt long, more usually at least about 30 nt long, and may extend to the complete sequence that is being compared. Algorithms for sequence analysis are known in the art, such as BLAST, described in Altschul et al. (1990), J. Mol. Biol. 215:403-10. Unless specified otherwise, all sequence identity values provided herein are determined using GCG (Genetics Computer Group, Wisconsin Package, Standard Settings, gap creation penalty 3.0, gap extension penalty 0.1). The sequences provided herein are essential for recognizing DGAT-related and homologous plant polynucleotides in database searches. A P-value cut-off, as determined by the output of a BLAST search, may also be used to determine whether a sequence in a database is a homolog. When a P-value cut-off is utilized, usually a homologs are identified with a P-value cut-off of 10⁻¹⁰ and below, 10⁻²⁰ and below, 10⁻³⁰ and below or 10⁻⁵⁰ and below.

[0041] Nucleic acids encoding the plant DGAT proteins and DGAT polypeptides of the subject invention may be cDNAs or genomic DNAs, as well as fragments thereof. The term “DGAT-gene” shall be intended to mean the open reading frame encoding specific plant DGAT proteins and polypeptides, and DGAT introns, as well as adjacent 5′ and 3′ non-coding nucleotide sequences involved in the regulation of expression, up to about 20 kb beyond the coding region, but possibly further in either direction. The gene may be introduced into an appropriate vector for extrachromosomal maintenance or for integration into a host genome.

[0042] The term “cDNA” as used herein is intended to include all plant nucleic acids that share the arrangement of sequence elements found in native mature mRNA species, where sequence elements are exons and 5′ and 3′ non-coding regions. Normally mRNA species have contiguous exons, with the intervening introns, when present, being removed by nuclear RNA splicing, to create a continuous open reading frame encoding a DGAT protein.

[0043] A genomic sequence of interest comprises the nucleic acid present between the initiation codon and the stop codon, as defined in the listed sequences, including all of the introns that are normally present in a native chromosome. It may further include the 5′ and 3′ untranslated regions found in the mature mRNA. It may further include specific transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., including about 1 kb, but possibly more, of flanking genomic DNA at either the 5′ and 3′ end of the transcribed region. The genomic DNA may be isolated as a fragment of 100 kbp or smaller; and substantially free of flanking chromosomal sequence. The genomic DNA flanking the coding region, either 3′ or 5′, or internal regulatory sequences as sometimes found in introns, contains sequences required for proper tissue and stage specific expression.

[0044] The nucleic acid compositions of the subject invention may encode all or a part of the subject plant DGAT proteins and polypeptides, described in greater detail infra. Double or single stranded fragments may be obtained from the DNA sequence by chemically synthesizing oligonucleotides in accordance with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. For the most part, DNA fragments will be of at least 15 nt, usually at least 18 nt or 25 nt, and may be at least about 50 nt.

[0045] The plant DGAT-genes of the subject invention are isolated and obtained in substantial purity, generally as other than an intact chromosome. Usually, the DNA will be obtained substantially free of other nucleic acid sequences that do not include a DGAT sequence or fragment thereof, generally being at least about 50%, usually at least about 90% pure and are typically “recombinant”, i.e. flanked by one or more nucleotides with which it is not normally associated on a naturally occurring chromosome.

[0046] Also provided are nucleic acids that hybridize to the above described nucleic acids under stringent conditions, i.e. high or low stringency conditions. An example of high stringency hybridization conditions is hybridization overnight at about 50° C. in 0.1×SSC (15 mM sodium chloride/1.5 mM sodium citrate) followed by two 30 minute washes in 0.1×SSC at about 50° C. An example of low stringency conditions is hybridization overnight at about 50° C. in 2×SSC followed by two 30 minute washes in 2×SSC at about 50° C. Other stringent hybridization conditions are known in the art and may also be employed to identify nucleic acids of this particular embodiment of the invention. For example, other high stringency hybridization conditions include overnight incubation at temperatures other than 50° C., or overnight incubation at 42° C. in a solution containing 50% formamide, 5×SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 μg/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1×SSC at about 65° C. Usually the nucleic acids that hybridize to the above described nucleic acids are derived from plants, particularly libraries of plant polynucleotides from monocots or dicots, and particularly libraries of the species listed above.

[0047] Also provided are nucleic acids that encode the proteins encoded by the above described nucleic acids, but differ in sequence from the above described nucleic acids due to the degeneracy of the genetic code.

[0048] In addition to the plurality of uses described in greater detail in following sections, the subject nucleic acid compositions find use in the preparation of all or a portion of the plant DGAT polypeptides, as described below.

[0049] Plant DGAT Polypeptide Compositions

[0050] Also provided by the subject invention are plant polypeptides having DGAT activity, i.e. capable of catalyzing the acylation of diacylglycerol. In certain embodiments and in addition to being capable of catalyzing the esterification of diacylglycerol with a fatty acyl CoA substrates, the subject proteins are incapable of esterifying, at least to any substantial extent, the following substrates: cholesterol, 25-hydroxy-, 27-hydroxy-,7α-hydroxy- or 7β-hydroxycholesterols, 7-ketocholesterol, vitamins D2 and D3, vitamin E, dehydrepiandrosterone, retinol, ethanol, sitosterol, lanosterol and ergosterol.

[0051] The term “polypeptide” composition as used herein refers to both the full length proteins as well as portions or fragments thereof. Also included in this term are variations of the naturally occurring proteins, where such variations are homologous or substantially similar to the naturally occurring protein, as described in greater detail below, be the naturally occurring protein the Arabidopsis protein, canola protein, or corn protein of from some other species which naturally expresses a DGAT enzyme, be that species monoct or dicot. In the following description of the subject invention, the term DGAT is used to refer not only to the Arabidopsis form of the enzyme, but also to homologs thereof expressed in non-Arabidopsis species, including other dicot species such as canola and monocot species such as corn.

[0052] The subject plant DGAT proteins are, in their natural environment, trans-membrane proteins. The subject proteins are characterized by the presence of at least one potential N-linked glycosylation site, at least one potential tyrosine phosphorylation site, and multiple hydrophobic domains, including 6 to 12 hydrophobic domains capable of serving as trans-membrane regions. The proteins range in length from about 300 to 650, usually from about 450 to 550 and more usually from about 500 to 550 amino acid residues, and the projected molecular weight of the subject proteins based solely on the number of amino acid residues in the protein ranges from about 40 to 80, usually from about 45 to 75 and more usually from about 50 to 65 kDa, where the actual molecular weight may vary depending on the amount of glycolsylation of the protein and the apparent molecular weight may be considerably less (e.g. 40 to 50 kDa) because of SDS binding on gels.

[0053] The amino acid sequences of the subject proteins are characterized by having at least some homology to a corresponding ACAT protein from the same species, e.g. an Arabidopsis DGAT protein has at least some sequence homology with the ACAT protein, the corn DGAT protein has at least some sequence homology with the mouse ACAT-1 corn, etc., where the sequence homology will not exceed about 50%, and usually will not exceed about 40% and more usually will not exceed about 25%, but will be at least about 15% and more usually at least about 20%, as determined using GCG (Genetics Computer Group, Wisconsin Package, Standard Settings, Gap Creation Penalty 3.0, Gap Extension Penalty 0.1).

[0054] Of particular interest in many embodiments are plant DGAT proteins that are non-naturally glycosylated. By non-naturally glycosylated is meant that the protein has a glycosylation pattern, if present, which is not the same as the glycosylation pattern found in the corresponding naturally occurring protein. For example, human DGAT of the subject invention and of this particular embodiment is characterized by having a glycosylation pattern, if it is glycosylated at all, that differs from that of naturally occurring human DGAT. Thus, the non-naturally glycosylated DGAT proteins of this embodiment include non-glycosylated DGAT proteins, i.e. proteins having no covalently bound glycosyl groups.

[0055] In addition to the specific DGAT proteins described above, homologs or proteins (or fragments thereof) from other species, i.e. monocot and dicot plant species, are also provided, where such homologs or proteins may be from a variety of different types of species, including both monocot and dicots and in particular from agriculturally important plant species, including, but not limited to, crops such as soybean, wheat, corn, potato, cotton, rice, oilseed rape (including canola), sunflower, alfalfa, sugarcane, castor and turf; or fruits and vegetables, such as banana, blackberry, blueberry, strawberry, and raspberry, cantaloupe, carrot, cauliflower, coffee, cucumber, eggplant, grapes, honeydew, lettuce, mango, melon, onion, papaya, peas, peppers, pineapple, spinach, squash, sweet corn, tobacco, tomato, watermelon, rosaceous fruits (such as apple, peach, pear, cherry and plum) and vegetable brassicas (such as broccoli, cabbage, cauliflower, brussel sprouts and kohlrabi). Other crops, fruits and vegetables whose cells may be targetted include barley, rye, millet, sorghum, currant, avocado, citrus fruits such as oranges, lemons, grapefruit and tangerines, artichoke, cherries, nuts such as the walnut and peanut, endive, leek, roots, such as arrowroot, beet, cassava, turnip, radish, yarn, and sweet potato, Arabidopsis, beans, mint and other labiates. Woody species, such as pine, poplar, yew, rubber, palm, eucalyptus etc. and lower plants such as mosses, ferns, and algae are also sources. A source of homologous plant genes may also be selected from various plant families including Brassicaceae, Compositae, Euphorbiaceae, Leguminosae, Linaceae, Malvaceae, Umbilliferae and Graminae. Of particular interest are homologous genes from oilseed crops, such as canola, soybean and corn. By homolog is meant a protein having at least about 35%, usually at least about 40% and more usually at least about 60% amino acid sequence identity the specific DGAT proteins as identified in SEQ ID NOS: 04 to 06, where sequence identity is determined using GCG, supra.

[0056] Of particular interest in certain embodiments is the Arabidopsis DGAT protein, where the Arabidopsis DGAT protein of the subject invention has an amino acid sequence that comprises or includes a region substantially the same as or identical to the sequence appearing as SEQ ID NO:3 infra. As such, DGAT proteins having an amino acid sequence that is substantially the same as or identical to the sequence of SEQ ID NO:3 are of interest. By substantially the same as is meant a protein having a region with a sequence that has at least about 75%, usually at least about 90% and more usually at least about 98% sequence identity with the sequence of SED ID NO:3, as measured by GCG, supra. Of particular interest in other embodiments are the DGAT proteins from canola (SEQ ID NOS:5-9), nasurtium (SEQ ID NO:11), perilla (SEQ ID NO:13), tobacco (SEQ ID NO:15) and corn (encoded by nucleotides comprising SEQ ID NO:16 or SEQ ID NO:17). Also of particular interest in yet other embodiments of the subject invention is the A. thaliana DGAT protein (SEQ ID NO:3), where the A. thaliana DGAT protein of the subject invention has an amino acid sequence encoded by the nucleic acid comprising the sequence appearing as SEQ ID NO:1, infra.

[0057] The plant DGAT proteins of the subject invention (e.g. Arabidopsis DGAT or a homolog thereof, non-Arabidopsis DGAT proteins, e.g. canola DGAT, corn DGAT etc) are present in a non-naturally occurring environment, e.g. are separated from their naturally occurring environment. In certain embodiments, the subject DGAT is present in a composition that is enriched for DGAT as compared to DGAT in its naturally occurring environment. As such, purified DGAT is provided, where by purified is meant that DGAT is present in a composition that is substantially free of non DGAT proteins, where by substantially free is meant that less than 90%, usually less than 60% and more usually less than 50% of the composition is made up of non-DGAT proteins. For compositions that are enriched for DGAT proteins, such compositions will exhibit a DGAT activity of at least about 100, usually at least about 200 and more usually at least about 1000 pmol triglycerides formed/mg protein/min, where such activity is determined by the assay described in the Experimental Section, of U.S. patent application Ser. No. 10/040,315.

[0058] In certain embodiments of interest, the plant DGAT protein is present in a composition that is substantially free of the constituents that are present in its naturally occurring environment. For example, a plant DGAT protein comprising composition according to the subject invention in this embodiment will be substantially, if not completely, free of those other biological constituents, such as proteins, carbohydrates, lipids, etc., with which it is present in its natural environment. As such, protein compositions of these embodiments will necessarily differ from those that are prepared by purifying the protein from a naturally occurring source, where at least trace amounts of the proteins constituents will still be present in the composition prepared from the naturally occurring source.

[0059] The plant DGAT of the subject invention may also be present as an isolate, by which is meant that the DGAT is substantially free of both non-DGAT proteins and other naturally occurring biologic molecules, such as oligosaccharides, polynucleotides and fragments thereof, and the like, where substantially free in this instance means that less than 70%, usually less than 60% and more usually less than 50% of the composition containing the isolated DGAT is a non-DGAT naturally occurring biological molecule. In certain embodiments, the DGAT is present in substantially pure form, where by substantially pure form is meant at least 95%, usually at least 97% and more usually at least 99% pure.

[0060] In addition to the naturally occurring plant DGAT proteins, DGAT polypeptides which vary from the naturally occurring DGAT proteins are also provided. By DGAT polypeptides is meant proteins having an amino acid sequence encoded by an open reading frame (ORF) of a plant DGAT gene, described supra, including the full length DGAT protein and fragments thereof, particularly biologically active fragments and/or fragments corresponding to functional domains; and including fusions of the subject polypeptides to other proteins or parts thereof. Fragments of interest will typically be at least about 10 aa in length, usually at least about 50 aa in length, and may be as long as 300 aa in length or longer, but will usually not exceed about 1000 aa in length, where the fragment will have a stretch of amino acids that is identical to a DGAT protein of SEQ ID NOS: 3, 5, 7, 9, 11 or 13 or a homolog thereof, of at least about 10 aa, and usually at least about 15 aa, and in many embodiments at least about 50 aa in length.

[0061] Preparation of Plant DGAT Polypeptides

[0062] The subject plant DGAT proteins and polypeptides may be obtained from naturally occurring sources, but are preferably synthetically produced. Where obtained from naturally occurring sources, the source chosen will generally depend on the species from which the DGAT is to be derived.

[0063] The subject DGAT polypeptide compositions may be synthetically derived by expressing a recombinant gene encoding DGAT, such as the polynucleotide compositions described above, in a suitable host. For expression, an expression cassette may be employed. The expression vector will provide a transcriptional and translational initiation region, which may be inducible or constitutive, where the coding region is operably linked under the transcriptional control of the transcriptional initiation region, and a transcriptional and translational termination region. These control regions may be native to a DGAT gene, or may be derived from exogenous sources.

[0064] Expression vectors generally have convenient restriction sites located near the promoter sequence to provide for the insertion of nucleic acid sequences encoding heterologous proteins. A selectable marker operative in the expression host may be present. Expression vectors may be used for the production of fusion proteins, where the exogenous fusion peptide provides additional functionality, i.e. increased protein synthesis, stability, reactivity with defined antisera, an enzyme marker, e.g. β-galactosidase, etc.

[0065] Expression cassettes may be prepared comprising a transcription initiation region, the gene or fragment thereof, and a transcriptional termination region. Of particular interest is the use of sequences that allow for the expression of functional epitopes or domains, usually at least about 8 amino acids in length, more usually at least about 15 amino acids in length, to about 25 amino acids, and up to the complete open reading frame of the gene. After introduction of the DNA, the cells containing the construct may be selected by means of a selectable marker, the cells expanded and then used for expression.

[0066] DGAT proteins and polypeptides may be expressed in prokaryotes or eukaryotes in accordance with conventional ways, depending upon the purpose for expression. For large scale production of the protein, a unicellular organism, such as E. coli, B. subtilis, S. cerevisiae, the oeogenic filamentous fungus Mortierella ramanniana, insect cells in combination with baculovirus vectors, or cells of a higher organism such as plants, particularly monocots and dicots, e.g. Z. mays or tobacco cells, may be used as the expression host cells. In some situations, it is desirable to express the DGAT gene in eukaryotic cells, where the DGAT protein will benefit from native folding and post-translational modifications. Small peptides can also be synthesized in the laboratory. Polypeptides that are subsets of the complete DGAT sequence may be used to identify and investigate parts of the protein important for function.

[0067] Specific expression systems of interest include bacterial, yeast, insect cell and mammalian cell derived expression systems. Representative systems from each of these categories is are provided below:

[0068] Bacteria. Expression systems in bacteria include those described in Chang et al., Nature (1978) 275:615; Goeddel et al., Nature (1979) 281:544; Goeddel et al., Nucleic Acids Res. (1980) 8:4057; EP 0 036,776; U.S. Pat. No. 4,551,433; DeBoer et al., Proc. Natl. Acad. Sci. (USA) (983) 80:21-25; and Siebenlist et al., Cell (1980) 20:269.

[0069] Yeast. Expression systems in yeast include those described in Hinnen et al., Proc. Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et al., J. Bacteriol. (1983) 153:163; Kurtz et al., Mol. Cell. Biol (1986) 6:142; Kunze et al., J. Basic Microbiol. (1985) 25:141; Gleeson et al., J. Gen. Microbiol. (1986) 132-3459; Roggenkamp et al., Mol. Gen. Genet. (1986) 202:302; Das et al., J. Bacteriol. (1984) 158:1165; De Louvencourt et al., J. Bacteriol. (1983) 154.737; Van den Berg et al., Bio/Technology (1990) 8:135; Kunze et al., J. Basic Microbiol. (1985) 25:141; Cregg et al., Mol. Cell. Biol. (1985) 5:3376; U.S. Pat. Nos. 4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:706; Davidow et al., Curr. Genet. (1985) 10:380; Gaillardin et al., Curr. Genet. (1985) 10:49; Ballance et al., Biochem. Biophys. Res. Commun. (1983) 112:284-289; Tilburn et al., Gene (1983) 26:205-221; Yelton et al., Proc. Natl. Acad. Sci. (USA) (1984) 81:1470-1474; Kelly and Hynes, EMBO J. (1985) 4:475479; EP 0 244,234; and WO 91/00357.

[0070] Insect Cells. Expression of heterologous genes in insects is accomplished as described in U.S. Pat. No. 4,745,051; Friesen et al., “The Regulation of Baculovirus Gene Expression”, in: The Molecular Biology Of Baculoviruses (1986) (W. Doerfler, ed.); EP 0 127,839; EP 0 155,476; and Vlak et al., J Gen. Virol. (1988) 69:765-776; Miller et al., Ann. Rev. Microbiol. (1988) 42:177; Carbonell et al., Gene (1988) 73:409; Maeda et al., Nature (1985) 315:592-594; Lebacq-Verheyden et al., Mol. Cell. Biol. (1988) 8:3129; Smith et al., Proc. Natl. Acad. Sci. (USA) (1985) 82:8844; Miyajima et al., Gene (1987) 58:273; and Martin et al., DNA (1988) 7:99. Numerous baculoviral strains and variants and corresponding permissive insect host cells from hosts are described in Luckow et al., Bio/Technology (1988) 6:47-55, Miller et al., Generic Engineering (1986) 8:277-279, and Maeda et al., Nature (1985) 315:592-594.

[0071] Mammalian Cells. Mammalian expression is accomplished as described in Dijkema et al., EMBO J. (1985) 4:761, Gorman et al., Proc. Natl. Acad. Sci. (USA) (1982) 79:6777, Boshart et al., Cell (1985) 41:521 and U.S. Pat. No. 4,399,216. Other features of mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. (1979) 58:44, Barnes and Sato, Anal. Biochem. (1980) 102:255, U.S. Pat. Nos. 4,767,704, 4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. Pat. No. RE 30,985.

[0072] When any of the above host cells, or other appropriate host cells or organisms, are used to replicate and/or express the polynucleotides or nucleic acids of the invention, the resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope of the invention as a product of the host cell or organism.

[0073] Once the source of the protein is identified and/or prepared, e.g. a transfected host expressing the protein is prepared, the protein is then purified to produce the desired DGAT comprising composition. Any convenient protein purification procedures may be employed, where suitable protein purification methodologies are described in Guide to Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may prepared from the original source, e.g. naturally occurring cells or tissues that express DGAT or the expression host expressing DGAT, and purified using HPLC, exclusion chromatography, gel electrophoresis, affinity chromatography, and the like.

[0074] Once the gene corresponding to a selected polynucleotide is identified, its expression can be regulated in the cell to which the gene is native. For example, an endogenous gene of a cell can be regulated by an exogenous regulatory sequence introduced into the cell by site specific insertion, e.g., by homologous recombination. Further methods for modulating the expression of an endogenous gene include creating a library of plants each containing an endogenous modulating element (for example an activating T-DNA or transposable element) inserted at a different position in the plant's genome and screening the library for plants with modulated expression of the endogenous gene. A number of methods for modulating the expression of an endogenous gene of a cell are disclosed in U.S. Pat. Nos. 5,641,670, 5,968,502, 5,994,127, and 6,355,241, and in Plant Cell. (2000) 12: 2383-2394; Development (2000) 127: 4971-80; Plant Cell (2000) 12:1619-32; Science. (1999) 286:1962-5; Plant J. (1885) 359-365; Kempin et al. (1997) Nature 889:802-808; Plant Mol. Biol (2002) 48: 183-200, the disclosures of which is herein incorporated by reference.

[0075] Antibodies

[0076] Also of interest are antibodies that detect plant DGAT protein. Suitable antibodies are obtained by immunizing a host animal with peptides comprising all or a portion of a plant DGAT protein, such as the DGAT polypeptide compositions of the subject invention. Suitable host animals include mouse, rat sheep, goat, hamster, rabbit, etc. The origin of the protein immunogen may be monocot or dicot DGAT, from Arabidopsis, canola, nasturtium, tabacco or corn etc.

[0077] The immunogen may comprise the complete protein, or fragments and derivatives thereof. Preferred immunogens comprise all or a part of DGAT, where these residues contain the post-translation modifications, such as glycosylation, found on the native DGAT. Immunogens comprising the extracellular domain are produced in a variety of ways known in the art, e.g. expression of cloned genes using conventional recombinant methods, isolation from HEC, etc.

[0078] For preparation of polyclonal antibodies, the first step is immunization of the host animal with plant DGAT, where the DGAT will preferably be in substantially pure form, comprising less than about 1% contaminant. The immunogen may comprise complete DGAT, fragments or derivatives thereof. To increase the immune response of the host animal, the DGAT may be combined with an adjuvant, where suitable adjuvants include alum, dextran, sulfate, large polymeric anions, oil & water emulsions, e.g. Freund's adjuvant, Freund's complete adjuvant, and the like. The DGAT may also be conjugated to synthetic carrier proteins or synthetic antigens. A variety of hosts may be immunized to produce the polyclonal antibodies. Such hosts include rabbits, guinea pigs, rodents, e.g. mice, rats, sheep, goats, and the like. The DGAT is administered to the host, usually intradermally, with an initial dosage followed by one or more, usually at least two, additional booster dosages. Following immunization, the blood from the host will be collected, followed by separation of the serum from the blood cells. The Ig present in the resultant antiserum may be further fractionated using known methods, such as ammonium salt fractionation, DEAE chromatography, and the like.

[0079] Monoclonal antibodies are produced by conventional techniques. Generally, the spleen and/or lymph nodes of an immunized host animal provide a source of plasma cells. The plasma cells are immortalized by fusion with myeloma cells to produce hybridoma cells. Culture supernatant from individual hybridomas is screened using standard techniques to identify those producing antibodies with the desired specificity. Suitable animals for production of monoclonal antibodies to the human protein include mouse, rat, hamster, etc. To raise antibodies against the mouse protein, the animal will generally be a hamster, guinea pig, rabbit, etc. The antibody may be purified from the hybridoma cell supernatants or ascites fluid by conventional techniques, e.g. affinity chromatography using DGAT bound to an insoluble support, protein A sepharose, etc.

[0080] The antibody may be produced as a single chain, instead of the normal multimeric structure. Single chain antibodies are described in Jost et al. (1994) J.B.C. 269:26267B73, and others. DNA sequences encoding the variable region of the heavy chain and the variable region of the light chain are ligated to a spacer encoding at least about 4 amino acids of small neutral amino acids, including glycine and/or serine. The protein encoded by this fusion allows assembly of a functional variable region that retains the specificity and affinity of the original antibody.

[0081] Antibody fragments, such as Fv, F(ab′)₂ and Fab may be prepared by cleavage of the intact protein, e.g. by protease or chemical cleavage. Alternatively, a truncated gene is designed. For example, a chimeric gene encoding a portion of the F(ab′)₂ fragment would include DNA sequences encoding the CHl domain and hinge region of the H chain, followed by a translational stop codon to yield the truncated molecule.

[0082] Modifying DGAT Expression and/or Activity in a Plant

[0083] Also provided are methods of modifying DGAT expression and/or activity in a plant. In several embodiments, plant DGAT-encoding polynucleotides are used to modify DGAT activity in a plant to produce a variety of trait-modified plants, including plants with altered TAG levels, altered TAG compositions, or altered total seed protein content, particularly in seeds.

[0084] Several strategies may be employed to modify DGAT activity in a plant, including those that increase DGAT activity, and those that decrease DGAT activity.

[0085] Expression of a DGAT Transgene in a Plant

[0086] Typically, plant DGAT polynucleotide sequences of the invention are incorporated into recombinant nucleic acid, e.g. DNA or RNA, molecules that direct expression of polypeptides of the invention in appropriate host cells, transgenic plants, in vitro translation systems, or the like. Due to the inherent degeneracy of the genetic code, nucleic acid sequences which encode substantially the same or a functionally equivalent amino acid sequence can be substituted for any listed sequence to provide for cloning and expressing the relevant homologue.

[0087] The present invention includes recombinant constructs comprising one or more of the nucleic acid sequences herein. The constructs typically comprise a vector, such as a plasmid, a cosmid, a phage, a virus (e.g., a plant virus), a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are commercially available.

[0088] In general, expression plasmids will contain a selectable marker and DGAT nucleic acid sequences. The selectable marker provides resistance to toxic chemicals and allows selection of cells containing the marker over cells not containing the marker. Conveniently, the marker encodes resistance to a herbicide, e.g. phosphinothricin, glyphosate etc, or an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, chloramphenicol, or the like. The particular marker employed is one that allows for selection of transformed cells over cells lacking the introduced recombinant DNA. Antibiotic or herbicide resistance markers including cat (chloramphenicol acetyl transferase), npt II (neomycin phosphotransferase II), PAT (phosphinothricin acetyltransferase), ALS (acetolactate synthetase), EPSPS (5-enolpyruvyl-shikimate-3-phosphate synthase), and bxn (bromoxynil-specific nitrilase) may be used. A preferred marker sequence is a DNA sequence encoding a selective marker for herbicide resistance and most particularly a protein having enzymatic activity capable of inactivating or neutralizing herbicidal inhibitors of glutamine synthetase. The non-selective herbicide known as glufosinate (BASTA™ or LIBERTY™) is an inhibitor of the enzyme glutamine synthetase. It has been found that naturally occurring genes or synthetic genes can encode the enzyme phosphinothricin acetyl transferase (PAT) responsible for the inactivation of the herbicide. Such genes have been isolated from Streptomyces. Specific species include Streptomyces hygroscopicus (Thompson C. J. et al., EMBO J., vol. 6:2519-2523 (1987)), Streptomyces coelicolor (Bedford et al, Gene 104: 39-45 (1991)) and Streptomyces viridochromogenes (Wohlleben et al. Gene 80:25-57 (1988)). These genes including those that have been isolated or synthesized are also frequently referred to as bar genes. These genes have been cloned and modified for transformation and expression in plants (EPA 469 273 and U.S. Pat. No. 5,561,236). Through the incorporation of the pat gene, corn plants and their offspring can become resistant against phosphinothricin (or glufosinate).

[0089] General texts that describe molecular biological techniques useful herein, including the use and production of vectors, promoters and many other relevant topics, include Berger, Sambrook and Ausubel, supra. Any of the identified sequences can be incorporated into a cassette or vector, e.g., for expression in plants. A number of expression vectors suitable for stable transformation of plant cells or for the establishment of transgenic plants have been described including those described in Weissbach and Weissbach, (1989) Methods for Plant Molecular Biology, Academic Press, and Gelvin et al., (1990) Plant Molecular Biology Manual, Kluwer Academic Publishers. Specific examples include those derived from a Ti plasmid of Agrobacterium tumefaciens, as well as those disclosed by Herrera-Estrella et al. (1983) Nature 303:209, Bevan (1984) Nucl Acid Res. 12: 8711-8721, Klee (1985) Bio/Technology 3: 637-642, for dicotyledonous plants.

[0090] Alternatively, non-Ti vectors can be used to transfer the DNA into monocotyledonous plants and cells by using free DNA delivery techniques. Such methods can involve, for example, the use of liposomes, electroporation, microprojectile bombardment, silicon carbide whiskers, and viruses. By using these methods transgenic plants such as wheat, rice (Christou(1991) Bio/Technology 9: 957-962) and corn (Gordon-Kamm (1990) Plant Cell 2: 603-618) can be produced. An immature embryo can also be a good target tissue for monocots for direct DNA delivery techniques by using the particle gun (Weeks et al. (1993) Plant Physiol 102: 1077-1084; Vasil (1993) Bio/Technology 10: 667-674; Wan and Lemeaux (1994) Plant Physiol 104: 37-48, and for Agrobacterium-mediated DNA transfer (Ishida et al. (1996) Nature Biotech 14: 745-750).

[0091] Typically, plant transformation vectors include one or more cloned plant coding sequence (genomic or cDNA) under the transcriptional control of 5′ and 3′ regulatory sequences and a dominant selectable marker. Such plant transformation vectors typically also contain a promoter (e.g., a regulatory region controlling inducible or constitutive, environmentally- or developmentally-regulated, or cell- or tissue-specific expression), a transcription initiation start site, an RNA processing signal (such as intron splice sites), a transcription termination site, and/or a polyadenylation signal.

[0092] Examples of constitutive plant promoters which can be useful for expressing a DGAT sequence include: the cauliflower mosaic virus (CaMV) 35S promoter, which confers constitutive, high-level expression in most plant tissues (see, e.g., Odel et al. (1985) Nature 313:810); the nopaline synthase promoter (An et al. (1988) Plant Physiol 88:547); and the octopine synthase promoter (Fromm et al. (1989) Plant cell 1:977).

[0093] A variety of plant gene promoters that regulate gene expression in response to environmental, hormonal, chemical, developmental signals, and in a tissue-active manner can be used for expression of DGAT sequence in plants. Choice of a promoter is based largely on the phenotype of interest and is determined by such factors as tissue (e.g., seed, fruit, root, pollen, vascular tissue, flower, carpet, etc.), inducibility (e.g., in response to wounding, heat, cold, drought, light, pathogens, etc.), timing, developmental state, and the like. Numerous known promoters have been characterized and can favorable be employed to promote expression of a polynucleotide of the invention in a transgenic plant or cell of interest. For example, fruit-specific promoters that are active during fruit ripening (such as the dru 1 promoter (U.S. Pat. No. 5,783,393), or the 2A11 promoter (U.S. Pat. No. 4,943,674) and the tomato polygalacturonase promoter (Bird et al. (1988) Plant Mol Biol 11:651), root-specific promoters, such as those disclosed in U.S. Pat. Nos. 5,618,988, 5,837,848 and 5,905,186, pollen-active promoters such as PTA29, PTA26 and PTA13 (U.S. Pat. No. 5,792,929), promoters active in vascular tissue (Ringli and Keller (1998) Plant Mol Biol 37:977-988), flower-specific (Kaiser et al, (1995) Plant Mol Biol 28:231-243), pollen (Baerson et al. (1994) Plant Mol Biol 26:1947-1959), carpels (Ohl et al. (1990) Plant Cell 2:837-848), pollen and ovules (Baerson et al. (1993) Plant Mol Biol 22L255-267), auxin-inducible promoters (such as that described in van der Kop et al. (1999) Plant Mol Biol 39:979-990 or Baumann et al. (1999) Plant Cell 11:323-334), cytokinin-inducible promoter (Guevara-Garcia (1998) Plant Mol Biol 38: 743-753), promoters responsive to gibberellin (Shi et al. (1998) Plant Mol Biol 38: 1053-1060, Willmott et al. (1998) 38:817-825) and the like. Additional promoters are those that elicit expression in response to heat (Ainley et al. (1993) Plant Mol Biol 22: 13-23), light (e.g., the pea rbcS-3A promoter, Kuhlemeier et al. (1989) Plant Cell 1:471, and the maize rbcS promoter, Schaffner and Sheen (1991) Plant Cell 3: 997); wounding (e.g., wunI, Siebertz et al. (1989) Plant Cell 1:961); pathogens (such as the PR-1 promoter described in Buchel et al. (1999) Plant Mol. Biol. 40:387-396, and the PDF1.2 promoter described in Manners et al. (1998) Plant Mol. Biol. 38: 1071-80), and chemicals such as methyl jasmonate or salicylic acid (Gatz et al. (1997) Plant Mol Biol 48: 89-108). In addition, the timing of the expression can be controlled by using promoters such as those acting at senescence (An and Amaxon (1995) Science 270: 1986-1988); or late seed development (Odell et al. (1994) Plant Physiol 106:447-458). For modification of seed traits, seed specific expression of DGAT is preferable. This can be accomplished using one of the many seed-specific promoters available to one of skill in the art, including the seed-specific promoters of the napin, phaseolin, cruciferin, oleosin, oleate 12-hydroxylase (Plant J. (1998) 13:201-10), DC3 (U.S. Pat. No. 5,773,697) genes, etc.

[0094] Plant expression vectors can also include RNA processing signals that can be positioned within, upstream or downstream of the coding sequence. In addition, the expression vectors can include additional regulatory sequences from the 3′-untranslated region of plant genes, e.g., a 3′ terminator region to increase mRNA stability of the mRNA, such as the PI-II terminator region of potato or the octopine or nopaline synthase 3′ terminator regions.

[0095] Specific initiation signals can aid in efficient translation of coding sequences. These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where a coding sequence, its initiation codon and upstream sequences are inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, exogenous transcriptional control signals including the ATG initiation codon can be separately provided. The initiation codon is provided in the correct reading frame to facilitate transcription. Exogenous transcriptional elements and inititation codons can be of various origins, both natural and synthetic The efficiency of expression can be enhanced by the inclusion of enhancers appropriate to the cell system in use.

[0096] Reduction of Endogenous Gene Expression

[0097] It is often desirable to reduce the expression of DGAT in a plant to reduce the amount of TAG or increase the amount of total protein, particularly in seeds. The subject nucleic acids may be used in many methods to reduce DGAT activity, several of which methods are described below. Exemplary methods for reducing activity of DGAT plant are sub-divided into gene “silencing” and “knock-out” strategies.

[0098] Methods for gene silencing, including antisense, RNAi, ribozyme and cosuppression technologies are based in hybridization of an expressed exogenous nucleic acid with an RNA transcribed from an endogenous gene of interest in a plant cell. Because the methods are based on hybridization, the methods are particularly applicable to silencing of gene families that share a level of sequence identity, for example for families of genes that contain 60% or more, 70% or more, 80% or more, 90% or more or 95% or more sequence identity over 100, 200, or 500 or more nucleotides. RNA-induced silencing strategies for plants are reviewed in Matzke et al (Curr Opin Genet Dev. 2001 11:221-7). Such cells may be used to decrease the endogenous levels of DGAT in a plant.

[0099] Antisense, Cosuppression and RNAi Approaches

[0100] In addition to expression of the plant DGAT nucleic acids of the invention as plant phenotype modification nucleic acids, the nucleic acids are also useful for sense and anti-sense suppression of expression, e.g., to down-regulate expression of DGAT, e.g., as a further mechanism for modulating plant phenotype. That is, the nucleic acids of the invention, or subsequences or anti-sense sequences thereof, can be used to block expression of naturally occurring DGAT nucleic acids. A variety of sense and anti-sense technologies are known in the art, e.g., as set forth in Lichtenstein and Nellen (1997) Antisense Technology: A Practical Approach IRL Press at Oxford University, Oxford, England. In general, sense or antisense sequences are introduced into a cell, where they are optionally amplified, e.g., by transcription. Such sequences include both simple oligonucleotide sequences and catalytic sequences such as ribozymes.

[0101] For example, a reduction or elimination of expression (i.e., a “knock-out”) of a DGAT homologous gene in a transgenic plant, e.g., to decrease TAG levels, can be obtained by introducing an antisense construct containing a DGAT cDNA. For antisense suppression, the DGAT cDNA is arranged in reverse orientation (with respect to the coding sequence) relative to the promoter sequence in the expression vector. The introduced sequence need not be the full length cDNA or gene, and need not be identical to the cDNA or gene found in the plant type to be transformed. Typically, the antisense sequence need only be capable of hybridizing to the target gene or RNA of interest. Thus, where the introduced sequence is of shorter length, a higher degree of homology to the endogenous transcription factor sequence will be needed for effective antisense suppression. While antisense sequences of various lengths can be utilized, preferably, the introduced antisense sequence in the vector will be at least 30 nucleotides in length, and improved antisense suppression will typically be observed as the length of the antisense sequence increases. Preferably, the length of the antisense sequence in the vector will be greater that 100 nucleotides. Transcription of an antisense construct as described results in the production of RNA molecules that are the reverse complement of mRNA molecules transcribed from the endogenous transcription factor gene in the plant cell.

[0102] Suppression of endogenous DGAT gene expression can also be achieved using a ribozyme. Ribozymes are RNA molecules that possess highly specific endoribonuclease activity. The production and use of ribozymes are disclosed in U.S. Pat. No. 4,987,071 and U.S. Pat. No. 5,543,508. Synthetic ribozyme sequences including antisense RNAs can be used to confer RNA cleaving activity on the antisense RNA, such that endogenous mRNA molecules that hybridize to the antisense RNA are cleaved, which I turn leads to an enhanced antisense inhibition of endogenous gene expression.

[0103] Vectors in which RNA encoded by a DGAT cDNA is over-expressed can also be used to obtain co-suppression of a corresponding endogenous gene, e.g., in the manner described in U.S. Pat. No. 5,231,020 to Jorgensen. Such co-suppression (also termed sense suppression) does not require that the entire DGAT cDNA be introduced into the plant cells, nor does it require that the introduced sequence be exactly identical to the endogenous DGAT. However, as with antisense suppression, the suppressive efficiency will be enhanced as specificity of hybridization is increased, e.g., as the introduced sequence is lengthened, and/or as the sequence similarity between the introduced sequence and the endogenous transcription factor gene is increased.

[0104] Vectors expressing an untranslatable form of the DGAT mRNA, e.g., sequences comprising one or more stop codon, or nonsense mutation) can also be used to suppress expression of an endogenous DGAT, thereby reducing or eliminating it's activity and modifying one or more traits. Methods for producing such constructs are described in U.S. Pat. No. 5,583,021. Preferably, such constructs are made by introducing a premature stop codon into the DGAT gene.

[0105] Alternatively, DGAT gene expression can be modified by gene silencing using double-strand RNA (Sharp (1999) Genes and Development 13: 139-141). RNAi, otherwise known as double-stranded RNA interference (dsRNAi), has been extensively documented in the nematode C. elegans (Fire, A., et al, Nature, 391, 806-811, 1998) and an identical phenomenon occurs in plants, in which it is usually referred to as post-transcriptional gene silencing (PTGS) (Van Blokland, R., et al., Plant J., 6: 861-877, 1994; deCarvalho-Niebel, F., et at, Plant Cell, 7: 347-358, 1995; Jacobs, J. J. M. R. et al., Plant J., 12: 885-893, 1997; reviewed in Vaucheret, H., et al., Plant J., 16: 651-659, 1998). The phenomenon also occurs in fungi (Romano, N. and Masino, G., Mol. Microbiol., 6: 3343-3353, 1992, Cogoni, C., et al., EMBO J., 15: 3153-3163; Cogoni, C. and Masino, G., Nature, 399: 166-169, 1999), in which it is often referred to as “quelling”. RNAi silencing can be induced many ways in plants, where a nucleic acid encoding an RNA that forms a “hairpin” structure is employed in most embodiments. Alternative strategies include expressing RNA from each end of the encoding nucleic acid, making two RNA molecules that will hybridize. Current strategies for RNAi induced silencing in plants are reviewed by Carthew et al (Curr Opin Cell Biol. 2001 13:244-8).

[0106] Another method for abolishing the expression of a DGAT gene is by insertion mutagenesis using the T-DNA of Agrobacterium tumefaciens. After generating the insertion mutants, the mutants can be screened to identify those containing the insertion in a DGAT gene. Plants containing a single transgene insertion event at the desired gene can be crossed to generate homozygous plants for the mutation (Koncz et al. (1992) Methods in Arabidopsis Research, World Scientific).

[0107] Alternatively, a plant phenotype can be altered by eliminating an endogenous DGAT, e.g., by homologous recombination (Kempin et al. (1997) Nature 389:802). A DGAT gene can also be modified by using the cre-lox system (for example, as described in U.S. Pat. No. 5,658,772). A plant genome can be modified to include first and second lox sites that are then contacted with a Cre recombinase. If the lox sites are in the same orientation, the intervening DNA sequence between the two sites is excised. If the lox sites are in the opposite orientation, the intervening sequence is inverted.

[0108] The plant DGAT polynucleotides and polypeptides of this invention can also be expressed in a plant in the absence of an expression cassette by manipulating the activity or expression level of the endogenous gene by other means. For example, by ectopically expressing a gene by T-DNA activation tagging (Ichikawa et al. (1997) Nature 390 698-701; Kakimoto et al. (1996) Science 274:982-985). This method entails transforming a plant with a gene tag containing multiple transcriptional enhancers and once the tag has inserted into the genome, expression of a flanking gene coding sequence becomes deregulated. In another example, the transcriptional machinery in a plant can be modified so as to increase transcription levels of a polynucleotide of the invention (See, e.g., PCT Publication WO 96/06166 and WO 98/53057, which describe the modification of the DNA binding specificity of zinc finger proteins by changing particular amino acids in the DNA binding motif).

[0109] Plant Transformation

[0110] Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating the DGAT polynucleotides of the invention and/or expressing the DGAT polypeptides of the invention can be produced by a variety of well established techniques. Following construction of a vector, most typically an expression cassette, including a polynucleotide, e.g., encoding a DGAT or DGAT homolog of the invention, standard techniques can be used to introduce the polynucleotide into a plant, a plant cell, a plant explant or a plant tissue of interest. Optionally, the plant cell, explant or tissue can be regenerated to produce a transgenic plant.

[0111] The plant can be any higher plant, including gymnosperms, monocotyledonous and dicotyledenous plants. Suitable protocols are available for Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See protocols described in Ammirato et al. (1984) Handbook of Plant Cell Culture-Crop Species. Macmillan Publ. Co. Shimamoto et al. (1989) Nature 338:274-276, Fromm et al. (1990) Bio/Technology 8:833-839; and Vasil et al. (1990) Bio/Technology 8:429-434.

[0112] Transformation and regeneration of both monocotyledonous ad dicotyledonous plant cells is now routine, and the selection of the most appropriate transformation technique will be determined by the practitioner. The choice of method will vary with the type of plant to be transformed; those skilled in the art will recognize the suitability of particular methods for given plant types. Suitable methods can include, but are not limited to: electroporation of plant protoplast; liposome-mediated transformation; polyethylene glycol (PEG) mediated transformation; transformation using viruses: micro-injection of plant cells; micro-projectile bombardment of plant cells; vacuum infiltration; whiskers technology, and Agrobacterium tumeficiens mediated transformation. Transformation means introducing a nucleotide sequence into a plant in a manner to cause stable or transient expression of the sequence.

[0113] Successful examples of the modification of plant characteristics by transformation with cloned sequences which serve to illustrate the current knowledge in this field of technology, and which are herein incorporated by reference, include: U.S. Pat. Nos. 5,571,706; 5,677,175; 5,510,471; 5,750,386; 5,597,945; 5,589,615; 5,750,871; 5,268,526; 5,780,708; 5,538,880; 5,773,269; 5,736,369 and 5,610,042.

[0114] Following transformation, plants are preferably selected using a dominant selectable marker incorporated into the transformation vector. Typically, such a marker will confer antibiotic or herbicide resistance on the transformed plants, and selection of transformants can be accomplished by exposing the plants to appropriate concentrations of the antibiotic or herbicide.

[0115] After transformed plants are selected and grown to maturity, those plants showing a modified trait are identified. The modified trait can be any of those traits described below. Additionally, to confirm that the modified trait is due to changes in expression levels or activity of the polypeptide or polynucleotide of the invention can be determined by analyzing mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using immunoblots or Western blots or gel shift assays.

[0116] Utility

[0117] The above described methods and compositions find use in a variety of different applications, particularly in agricultural and food industries. In most embodiments, plant DGAT expression and/or activity is modified in a transgenic plant causing a modification in TAG composition or content of the transgenic plant, or a tissue thereof. Transgenic plants are usually processed, and used as an animal feedstuff, or may be used for human consumption, e.g., in a vegetable oil. In addition, TAG may be purified from transgenic plants and may be used as is or modified to be a component of many industrial compositions, such as biofuel, industrial lubricant, surfactant, paint, varnish, wax, soap, cosmetic and wax compositions. Plant DGAT may also be expressed in a non-plant host and used as an “industrial enzyme”, catalyzing the synthesis of TAG in vitro. Several such utilities are described in further detail below.

[0118] Of interest for use in producing triglyceride compositions are transgenic plant that have been genetically manipulated using the nucleic acid compositions of the subject invention to produce triglycerides and/or compositions thereof in one or more desirable ways. Transgenic plants of the subject invention are those plants that at least: (a) produce more triglyceride or triglyceride composition than wild type, e.g. produce more oil, such as by producing seeds having a higher oil content, as compared to wild-type; (b) produce less triglyceride or triglyceride composition than wild type, e.g. produce less oil, such as by producing seeds having a lower oil content, as compared to wild-type; (c) produce triglyceride compositions, e.g. oils, that are enriched for triglycerides and/or enriched for one or more particular triglycerides as compared to wild type; (d) produce more total seed protein as compared to wild type; (e) produce less total seed protein or seed protein composition as compared to wild type; and the like.

[0119] Of interest are transgenic plants that produce commercially valuable triglyceride compositions or oils, such as canola, rapeseed, palm, corn, etc., containing various poly- and mono-unsaturated fatty acids, and the like. Of particular interest are transgenic plants, such as canola, rapeseed, palm, oil, etc., which have been genetically modified to produce seeds having higher oil content than the content found in the corresponding wild type, where the oil content of the seeds produced by such plants is at least 10% higher, usually at least 20% higher, and in many embodiments at least 30% higher than that found in the wild type, where in many embodiments seeds having oil contents that are 50% higher, or even greater, as compared to seeds produced by the corresponding wild-type plant, are produced. The seeds produced by such DGAT transgenic plants can be used as sources of oil or as sources of additional DGAT transgenic plants. Such transgenic plants and seeds therefore find use in methods of producing oils. In such methods, DGAT transgenic plants engineered to produce seeds having a higher oil content than the corresponding wild-type, e.g. seeds in which the DGAT gene is overexpressed, are grown, the seeds are harvested and then processed to recover the oil. The subject transgenic plants can also be used to produce novel oils characterized by the presence of triglycerides in different amounts and/or ratios than those observed in naturally occurring oils. The transgenic plants described above can be readily produced by those of skill in the art armed with the nucleic acid compositions of the subject invention. Of particular interest are transgenic plants that overexpress DGAT exogenous DGAT proteins that have a different substrate specificity to the endogenous DGAT, allowing the alteration triglyceride composition. In these embodiments, e.g. the nasturtium DGAT polynucleotide (SEQ ID NO:6) that has some specificity to longer chain acyl molecules may be used to make longer chain triglycerides.

[0120] In many embodiments, the trait modifications of particular interest include those to seed (such as embryo or endosperm), fruit, root, flower, pericarp, leaf, stem, shoot, seedling, entire plant or the like

[0121] The triglyceride compositions described above find use in a variety of different applications. For example, such compositions or oils find use as food stuffs, being used as ingredients, spreads, cooking materials, etc. Alternatively, such oils find use as industrial feedstocks for use in the production of chemicals, lubricants, surfactants, paints, varnishes, and biofuels and the like.

[0122] Modulators of Plant DGAT Expression and/or Activity

[0123] The plant DGAT polynucleotides and polypeptides of the present invention may also be used to identify endogenous or exogenous molecules that can modulate DGAT expression and/or activity in a plant to produce a DGAT trait modified plant. In one embodiment, such molecules include organic and inorganic molecules that interact with the DGAT enzyme and modulate its activity, and in other embodiments, the small molecule may modulate the expression of DGAT by, e.g., modulating the expression of DGAT-encoding mRNA.

[0124] Endogenous molecules that interact with and modulate plant DGAT enzyme may be identified by any method for detecting covalent modification, e.g., phosphorylation, or for detecting protein-protein interactions, including co-immunoprecipitation, cross-linking, co-purification through gradients or chromatographic columns, or by the yeast two hybrid system (Chien et al, 1991, Proc. Natl. Acad. Sci., 88: 9578-9582). Endogenous molecules that modulate plant DGAT expression may be identified by measuring DGAT gene expression, for example using mRNA levels as determined by a microarray or northern blot hybridization, by measuring the amount of DGAT polypeptide or its activity, or by measuring the activity of a DGAT promoter, in a library of plants that have been genetically altered, e.g., by insertional (by T-DNA, transposons, etc) or chemical (EMS, X-ray, etc) mutagenesis. Once a plant with altered expression and/or DGAT activity is identified, the endogenous factor may be genetically mapped or otherwise determined, isolated and/or cloned.

[0125] In addition to methods for identifying endogenous DGAT modulatory molecules described above, the DGAT polynucleotides and polypeptides also provide methods for identifying exogenous molecules that modulate activity or expression of DGAT in plants. In this embodiment, a test agent, usually a small or large molecule, is placed in contact with a plant cell (or a tissue, explant, entire plant, or other composition containing a plant cell) and a resulting effect on the cell is evaluated by monitoring, either directly or indirectly, one or more of the expression level of the DGAT polynucleotide (e.g. by RNA blot hybridization or microarrays) or polypeptide (e.g. by western blotting) or DGAT activity (by activity assay). In many cases, an alteration of a plant DGAT trait can be detected following contact of the plant with a modulatory molecule.

[0126] The following examples are offered primarily for purposes of illustration. It will be readily apparent to those skilled in the art that the formulations, dosages, methods of administration, and other parameters of this invention may be further modified or substituted in various ways without departing from the spirit and scope of the invention.

EXPERIMENTAL Example I

[0127] Identification of DGAT cDNA from Arabidopsis thaliana.

[0128] The plant (Arabidopsis thaliana) DGAT gene (#AA042298) (SEQ ID NO:1) was identified from BLAST searches of the EST database using mouse DGAT sequences as a probe, as reported in U.S. patent application Ser. No. 09/103,754, the disclosure of which is herein incorporated by reference. The plant DGAT EST protein sequences encoded by plant DGAT genes are 40-50% identical to mammalianf DGAT enzymes. Furthermore, the plant DGAT sequences are more closely related to other mammalian DGAT sequences than to ACAT protein sequences.

Example II

[0129] Identification of Other Plant DGAT Polynucleotides and Polypeptides.

[0130] Using the Arabidopsis DGAT nucleic acid sequence described by SEQ ID NO:1 as a probe of the GenBank nucleotide sequence database using the TBLASTX (V2.2.3) and TBLASTN (V2.2.3), further plant DGAT polynucleotides and polypeptides were identified: Arabidopsis DGAT polynucleotide #AJ238008 (SEQ ID NO:2) and encoded polypeptide (SEQ ID NO:3), Brassica napus polynucleotide #AF251794 (SEQ ID NO:4) and encoded polypeptide (SEQ ID NO:5), B. napus polynucleotide #AF155224 (SEQ ID NO:6) and encoded polypeptide (SEQ ID NO:7), B. napus polynucleotide # AF164434 (SEQ ID NO:8) and encoded polypeptide (SEQ ID NO:9), Tropaeolum majus polynucleotide # AY084052 (SEQ ID NO:10) and encoded polypeptide (SEQ ID NO:11), Nicotiana tabacum polynucleotide #AF129003 (SEQ ID NO:12) and encoded polypeptide (SEQ ID NO:13), Perilla frutescens polynucleotide #AF298815 (SEQ ID NO:14) and encoded polypeptide (SEQ ID NO 15), Zea mays polynucleotides #AY110660 and #PCO148220 (SEQ ID NOS 16 and 17, respectively).

Example III

[0131] Increase of Oil Content in A. thaliana Seeds Expressing Plant DGAT Materials and Methods

[0132] Construction of DGAT cDNA Transformation Vector for Seed-Specific Expression: A full-length Arabidopsis thaliana DGAT cDNA (SEQ ID NO:2) is used as a template for PCR amplification with the primers DGATXbaI (CTAGTCTAGAATGGCGATTTTGGA) and DGATXhoI (GCGCTCGAGTTTCATGACATCGA) to provide new restriction sites on each end of the sequence. The PCR profile is as follows: 94° C. for 1 min; 30 cycles of 94° C. for 30 s, 55° C. for 30 s, 72° C. for 1 min; and 72° C. for 5 min. The PCR product is then ligated into the PCR-2.1 vector (Invitrogen, Carlsbad, Calif.). A 1.6-kb fragment is excised by a XbaI/KpnI digestion and ligated into the corresponding sites of the pSE. The plant transformation vector pSE is prepared from pRD400 (Datla et al., 1992 Gene 211: 383-384) by introducing a HindIII/XbaI fragment containing the B. napus napin promoter (Josefsson et al, 1987 J Biol Chem 262: 12196-12201) and a KpnI/EcoRI fragment containing the Agrobacterium nos terminator (Bevan, 1983 Acid Res 12: 8711-8721). The 1.6-kb DGAT cDNA fragment is ligated into XbaI/KpnI-digested pSE in the sense orientation. The resulting plasmid is designated napin:DGAT. Hence in the napin:DGAT construct, the Arabidopsis DGAT cDNA is under the control of the napin promoter. The construct integrity is confirmed by sequencing.

[0133] Transformation of Agrobacterium with Plant DGAT Vector Constructs: Electrocompetent Agrobacterium cells strain GV3101 (pMP90), are prepared and the Agrobacterium cells are transformed by electroporation with 20 to 50 ng of transforming DNA (napin:DGAT) according to the manufacturer's instructions, plated on a selective medium (Luria-Bertani broth with 50 μg mL1 kanamycin), and incubated for 48 h at 28° C. Single transformed cells are grown for 16 h (28° C., 225 rpm) in 5 mL Luria-Bertani broth with 50 μg mL1 kanamycin and 25 μg mL1 gentamycin. DNA extraction and purification are performed with a Qiaprep Spin Miniprep kit (Qiagen, Valencia, Calif.). The fidelity of the construct is rechecked by DNA sequencing before plant transformation.

[0134] Transformation of Arabidopsis: Seeds of Arabidopsis ecotype Columbia WT and mutant AS11 (Katavic et al., 1995) are grown at 22° C. under fluorescent illumination (120 μE m2 s1) in a 16-h-light/8-h-dark regime. Four to six plants typically are raised in a 10 cm2 pot in moistened Terra-lite Redi-earth (W. R. Grace and Company, Ajax, Ontario, Canada). To grow Agrobacterium, a 5-mL suspension in Luria-Bertani medium containing 50 μg mL1 kanamycin and 25 μg mL1 gentamycin is cultured overnight at 28° C. The day before infiltration, this “seed culture” is divided into four flasks containing 250 mL of Luria-Bertani medium supplemented with 50 μg mL1 kanamycin and 25 μg mL1 gentamycin. These culture are grown overnight at 28° C. Plants are vacuum infiltrated in an Agrobacterium suspension when the first flowers started opening.

[0135] The transformation is performed by vacuum infiltration using Silwet L-77 at a concentration of 0.005% in the dipping solution. The next day, the plants are uncovered, set upright, and allowed to grow for approximately 4 weeks in a growth chamber under continuous light conditions as described by Katavic et al. (1995). When the siliques are mature and dry, seeds are harvested and selected for positive transformants.

[0136] Selection of Putative Transformants (Transgenic Plants) and Analysis of Transgenic Plants and Seed Weights: For each construct, seeds are harvested in bulk. Seeds are surface-sterilized by submerging them in a solution containing 20% (v/v) bleach and 0.01% (v/v) Triton X-100 for 20 min, followed by three rinses with sterile water. Sterilized seeds are then plated by resuspending them in sterile 0.1% (w/v) phytagar at room temperature (approximately 1 mL phytagar for every 500-1,000 seeds), and then applying a volume containing 2,000 to 4,000 seeds onto 150×15-mm kanamycin selection plate. Plates are incubated for 2 d in the cold without light and then grown for 7 to 10 d in a controlled environment (22° C. under fluorescent illumination [120 μE m2 s1] in a 16-h-light/8-h-dark regime). The selection media contained one-half Murashige Skoog Gamborg medium, 0.8% (w/v) phytagar, 3% (w/v) Suc, 50 μg mL1 kanamycin, and 50 μg mL1 timentin. Petri dishes and lids are sealed with a Micropore surgical tape (3M Canada, Inc., London, ON, Canada). After 7 to 10 d, drug-resistant plants that had green leaves and well established roots within the medium are identified as T1 transformants, and at the 3- to 5-leaf stage selected transformants are transplanted into flats filled with heavily moistened soil mix. Transformants are grown to maturity, and mature seeds (T2 generation as defined in Katavic et al., 1994) are harvested from individual plants and further analyzed or propagated. Segregation analyses are performed on T2 plantlets screened on kanamycin to determine whether there are single (expected ratio of resistant:susceptible plantlets=3:1) or multiple copies of the napin:DGAT transgene. Homozygous single and multiple insert T2 lines exhibiting enhanced oil deposition and DGAT expression compared with one-dozen plasmid-only control transgenics are propagated to give T3 seed lines, for which further data on oil content, average seed weight, and yield per plant are collected. Average seed weights are determined from pooled T2 or individual T3 segregant seed lots and based upon six to eight individual samplings of 150 to 250 seeds/sample, with the seeds of each replicate being accurately counted on an Electronic Dual Light Transilluminator (Ultra Lum, Paramount, Calif.), using Scion Image software (Scion Corporation, Frederick, Md.). Weights and total oil content of the seeds of these samples are then individually recorded.

[0137] Results

[0138] The napin:DGAT plasmid is introduced into A. tumefaciens, used to transform wild-type Arabidopsis, and the progeny is analyzed. A number of primary napin:DGAT transgenic lines are produced, the T1 plantlets grown to maturity, and T2 seeds harvested. At the same time a number of independent plasmid only control transgenic (pSE vector without DGAT insert) lines, as well as non-transformed (n-t) WT and AS11 lines are propagated and analyzed.

[0139] On a mature seed dry weight basis, dry weight oil content of DGAT transgenic seeds is expected increase significantly.

Example IV

[0140] Reduction of Oil Content in Brassica Plants Expressing Antisense DGAT

[0141] The Brassica napus cDNA described by SEQ ID NO:4 is used to design two PCR primers for amplification. The primers are: DGAT1 (CATCATCATCATACTGCCATGGACAGGTGTGATTCTGCTFTTTTATCA; SEQ ID NO:20) and DGAT2 (CTACTACTACTACTACTAGAGACAGGGCAATGTAGAAAGTATGTA; SEQ ID NO:21). A fragment of the DGAT gene is amplified from B. napus cv Westar genomic as follows: each PCR amplification is carried out in a 100 μl PCR reaction mixture containing 50 ng of B. napus genomic DNA, 200 μm of each dNTP, 1X buffer B (Gibco BRL), 1 μm of each primer, 3 mM magnesium sulfate and 2 μl Elongase enzyme (Gibco BRL). The reaction mixture is denatured at 94° C. for 3 minutes, followed by 30 cycles of denaturation at 94° C. for 1 minute, annealing at 50° C. for 2 minutes and extension at 72° C. for 3 minutes. A final extension incubation is performed at 72° C. for 10 minutes after cycling. A 1.4 kb fragment is amplified.

[0142] The amplified DNA fragment is cloned into pAMP1 vector (Gibco BRL), then excised using SmaI and SnaBI and ligated, in an antisense orientation, into pMB110, containing the seed specific cruciferin promoter and cruciferin termination sequences. This construct us used to transform Agrobacterium strain LBA4404, and the transformed Agrobacterium is used to transform B. napus cv. Westar. Regenerated plantlets are transferred to a greenhouse and grown to maturity. Each plant is self pollinated and seeds are harvested from each plant.

[0143] TAG is expected decrease significantly in the seeds of many of plants containing the DGAT antisense construct plants. Since a decrease in TAG levels is usually correlated with an increase in total cellular seed protein, a corresponding increase in total seed protein is also expected for these transgenic plants.

[0144] A similar experiment using an RNA interference strategy where the DGAT is expressed as a hairpin structure is predicted to give a more significant decrease in TAG levels.

[0145] It is evident from the above results and discussion that the subject invention provides an important new means for modulating TAG levels in plants Specifically, the subject invention provides a system for increasing or decreasing the levels of TAG in a plant seed. As such, the subject methods and systems find use in a variety of different applications, including research, industry, and the production of animal and human feedstuffs. Accordingly, the present invention represents a significant contribution to the art.

[0146] All publications and patent applications cited in this specification are herein incorporated by reference as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference. The citation of any publication is for its disclosure prior to the filing date and should not be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention.

[0147] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 17 <210> SEQ ID NO 1 <211> LENGTH: 629 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: 455, 464, 467, 475, 497, 500, 508, 514, 519, 536, 543, 544, 576, 583, 584, 597 <223> OTHER INFORMATION: n = A,T,C or G <400> SEQUENCE: 1 tgcatgtata cggaagggtt gggtggctcg tcaatttgca aaactggtca tattcaccgg 60 attcatggga tttataatag aacaatatat aaatcctatt gtcaggaact caaagcatcc 120 tttgaaaggc gatcttctat atgctattga aagagtgttg aagctttcag ttccaaattt 180 atatgtgtgg ctctgcatgt tctactgctt cttccacctt tggttaaaca tattggcaga 240 gcttctctgc ttcggggatc gtgaattcta caaagattgg tggaatgcaa aaagtgtggg 300 agattactgg gagaatgtgg aatatgcctg tccataaatg ggatgggtcc gacatatata 360 ccttccccgt gcttgcgcac aaggattacc caaagacacc ccggccatta accattggct 420 ttcccaagcc ccctggaggc ctttccatgg gccanggacc cggngtnccc tggcnggccc 480 ttcaaagcaa agggggnttn cctggggnta aagntccang ggcccttggg gcccanccaa 540 aannttcccc cgggaaaggg ttgcccaccg gggggngaaa aanncccggg ggcaccncgg 600 aattttggga acccgggggg ggccttttt 629 <210> SEQ ID NO 2 <211> LENGTH: 1904 <212> TYPE: DNA <213> ORGANISM: Arabidopsis thaliana <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (139)...(1701) <400> SEQUENCE: 2 atttcttagc ttcttccttc aatccgctct ttccctctcc attagattct gtttcctctt 60 tcaatttctt ctgcatgctt ctcgattctc tctgacgcct cttttctccc gacgctgttt 120 cgtcaaacgc ttttcgaa atg gcg att ttg gat tct gct ggc gtt act acg 171 Met Ala Ile Leu Asp Ser Ala Gly Val Thr Thr 1 5 10 gtg acg gag aac ggt ggc gga gag ttc gtc gat ctt gat agg ctt cgt 219 Val Thr Glu Asn Gly Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg 15 20 25 cga cgg aaa tcg aga tcg gat tct tct aac gga ctt ctt ctc tct ggt 267 Arg Arg Lys Ser Arg Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly 30 35 40 tcc gat aat aat tct cct tcg gat gat gtt gga gct ccc gcc gac gtt 315 Ser Asp Asn Asn Ser Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val 45 50 55 agg gat cgg att gat tcc gtt gtt aac gat gac gct cag gga aca gcc 363 Arg Asp Arg Ile Asp Ser Val Val Asn Asp Asp Ala Gln Gly Thr Ala 60 65 70 75 aat ttg gcc gga gat aat aac ggt ggt ggc gat aat aac ggt ggt gga 411 Asn Leu Ala Gly Asp Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly 80 85 90 aga ggc ggc gga gaa gga aga gga aac gcc gat gct acg ttt acg tat 459 Arg Gly Gly Gly Glu Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr 95 100 105 cga ccg tcg gtt cca gct cat cgg agg gcg aga gag agt cca ctt agc 507 Arg Pro Ser Val Pro Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser 110 115 120 tcc gac gca atc ttc aaa cag agc cat gcc gga tta ttc aac ctc tgt 555 Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe Asn Leu Cys 125 130 135 gta gta gtt ctt att gct gta aac agt aga ctc atc atc gaa aat ctt 603 Val Val Val Leu Ile Ala Val Asn Ser Arg Leu Ile Ile Glu Asn Leu 140 145 150 155 atg aag tat ggt tgg ttg atc aga acg gat ttc tgg ttt agt tca aga 651 Met Lys Tyr Gly Trp Leu Ile Arg Thr Asp Phe Trp Phe Ser Ser Arg 160 165 170 tcg ctg cga gat tgg ccg ctt ttc atg tgt tgt ata tcc ctt tcg atc 699 Ser Leu Arg Asp Trp Pro Leu Phe Met Cys Cys Ile Ser Leu Ser Ile 175 180 185 ttt cct ttg gct gcc ttt acg gtt gag aaa ttg gta ctt cag aaa tac 747 Phe Pro Leu Ala Ala Phe Thr Val Glu Lys Leu Val Leu Gln Lys Tyr 190 195 200 ata tca gaa cct gtt gtc atc ttt ctt cat att att atc acc atg aca 795 Ile Ser Glu Pro Val Val Ile Phe Leu His Ile Ile Ile Thr Met Thr 205 210 215 gag gtt ttg tat cca gtt tac gtc acc cta agg tgt gat tct gct ttt 843 Glu Val Leu Tyr Pro Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe 220 225 230 235 tta tca ggt gtc act ttg atg ctc ctc act tgc att gtg tgg cta aag 891 Leu Ser Gly Val Thr Leu Met Leu Leu Thr Cys Ile Val Trp Leu Lys 240 245 250 ttg gtt tct tat gct cat act agc tat gac ata aga tcc cta gcc aat 939 Leu Val Ser Tyr Ala His Thr Ser Tyr Asp Ile Arg Ser Leu Ala Asn 255 260 265 gca gct gat aag gcc aat cct gaa gtc tcc tac tac gtt agc ttg aag 987 Ala Ala Asp Lys Ala Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys 270 275 280 agc ttg gca tat ttc atg gtc gct ccc aca ttg tgt tat cag cca agt 1035 Ser Leu Ala Tyr Phe Met Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser 285 290 295 tat cca cgt tct gca tgt ata cgg aag ggt tgg gtg gct cgt caa ttt 1083 Tyr Pro Arg Ser Ala Cys Ile Arg Lys Gly Trp Val Ala Arg Gln Phe 300 305 310 315 gca aaa ctg gtc ata ttc acc gga ttc atg gga ttt ata ata gaa caa 1131 Ala Lys Leu Val Ile Phe Thr Gly Phe Met Gly Phe Ile Ile Glu Gln 320 325 330 tat ata aat cct att gtc agg aac tca aag cat cct ttg aaa ggc gat 1179 Tyr Ile Asn Pro Ile Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp 335 340 345 ctt cta tat gct att gaa aga gtg ttg aag ctt tca gtt cca aat tta 1227 Leu Leu Tyr Ala Ile Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu 350 355 360 tat gtg tgg ctc tgc atg ttc tac tgc ttc ttc cac ctt tgg tta aac 1275 Tyr Val Trp Leu Cys Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn 365 370 375 ata ttg gca gag ctt ctc tgc ttc ggg gat cgt gaa ttc tac aaa gat 1323 Ile Leu Ala Glu Leu Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp 380 385 390 395 tgg tgg aat gca aaa agt gtg gga gat tac tgg aga atg tgg aat atg 1371 Trp Trp Asn Ala Lys Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met 400 405 410 cct gtt cat aaa tgg atg gtt cga cat ata tac ttc ccg tgc ttg cgc 1419 Pro Val His Lys Trp Met Val Arg His Ile Tyr Phe Pro Cys Leu Arg 415 420 425 agc aag ata cca aag aca ctc gcc att atc att gct ttc cta gtc tct 1467 Ser Lys Ile Pro Lys Thr Leu Ala Ile Ile Ile Ala Phe Leu Val Ser 430 435 440 gca gtc ttt cat gag cta tgc atc gca gtt cct tgt cgt ctc ttc aag 1515 Ala Val Phe His Glu Leu Cys Ile Ala Val Pro Cys Arg Leu Phe Lys 445 450 455 cta tgg gct ttt ctt ggg att atg ttt cag gtg cct ttg gtc ttc atc 1563 Leu Trp Ala Phe Leu Gly Ile Met Phe Gln Val Pro Leu Val Phe Ile 460 465 470 475 aca aac tat cta cag gaa agg ttt ggc tca acg gtg ggg aac atg atc 1611 Thr Asn Tyr Leu Gln Glu Arg Phe Gly Ser Thr Val Gly Asn Met Ile 480 485 490 ttc tgg ttc atc ttc tgc att ttc gga caa ccg atg tgt gtg ctt ctt 1659 Phe Trp Phe Ile Phe Cys Ile Phe Gly Gln Pro Met Cys Val Leu Leu 495 500 505 tat tac cac gac ctg atg aac cga aaa gga tcg atg tca tga 1701 Tyr Tyr His Asp Leu Met Asn Arg Lys Gly Ser Met Ser * 510 515 520 aacaactgtt caaaaaatga ctttcttcaa acatctatgg cctcgttgga tctccgttga 1761 tgttgtggtg gttctgatgc taaaacgaca aatagtgtta taaccattga agaagaaaag 1821 aaaattagag ttgttgtatc tgcaaaaatt ttggtagaga cacgcaaacc cgtttggatt 1881 ttgttatggt gtaaagcggc cgc 1904 <210> SEQ ID NO 3 <211> LENGTH: 520 <212> TYPE: PRT <213> ORGANISM: Arabidopsis thaliana <400> SEQUENCE: 3 Met Ala Ile Leu Asp Ser Ala Gly Val Thr Thr Val Thr Glu Asn Gly 1 5 10 15 Gly Gly Glu Phe Val Asp Leu Asp Arg Leu Arg Arg Arg Lys Ser Arg 20 25 30 Ser Asp Ser Ser Asn Gly Leu Leu Leu Ser Gly Ser Asp Asn Asn Ser 35 40 45 Pro Ser Asp Asp Val Gly Ala Pro Ala Asp Val Arg Asp Arg Ile Asp 50 55 60 Ser Val Val Asn Asp Asp Ala Gln Gly Thr Ala Asn Leu Ala Gly Asp 65 70 75 80 Asn Asn Gly Gly Gly Asp Asn Asn Gly Gly Gly Arg Gly Gly Gly Glu 85 90 95 Gly Arg Gly Asn Ala Asp Ala Thr Phe Thr Tyr Arg Pro Ser Val Pro 100 105 110 Ala His Arg Arg Ala Arg Glu Ser Pro Leu Ser Ser Asp Ala Ile Phe 115 120 125 Lys Gln Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Ile 130 135 140 Ala Val Asn Ser Arg Leu Ile Ile Glu Asn Leu Met Lys Tyr Gly Trp 145 150 155 160 Leu Ile Arg Thr Asp Phe Trp Phe Ser Ser Arg Ser Leu Arg Asp Trp 165 170 175 Pro Leu Phe Met Cys Cys Ile Ser Leu Ser Ile Phe Pro Leu Ala Ala 180 185 190 Phe Thr Val Glu Lys Leu Val Leu Gln Lys Tyr Ile Ser Glu Pro Val 195 200 205 Val Ile Phe Leu His Ile Ile Ile Thr Met Thr Glu Val Leu Tyr Pro 210 215 220 Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr 225 230 235 240 Leu Met Leu Leu Thr Cys Ile Val Trp Leu Lys Leu Val Ser Tyr Ala 245 250 255 His Thr Ser Tyr Asp Ile Arg Ser Leu Ala Asn Ala Ala Asp Lys Ala 260 265 270 Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe 275 280 285 Met Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Ala 290 295 300 Cys Ile Arg Lys Gly Trp Val Ala Arg Gln Phe Ala Lys Leu Val Ile 305 310 315 320 Phe Thr Gly Phe Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile 325 330 335 Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala Ile 340 345 350 Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys 355 360 365 Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu 370 375 380 Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys 385 390 395 400 Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp 405 410 415 Met Val Arg His Ile Tyr Phe Pro Cys Leu Arg Ser Lys Ile Pro Lys 420 425 430 Thr Leu Ala Ile Ile Ile Ala Phe Leu Val Ser Ala Val Phe His Glu 435 440 445 Leu Cys Ile Ala Val Pro Cys Arg Leu Phe Lys Leu Trp Ala Phe Leu 450 455 460 Gly Ile Met Phe Gln Val Pro Leu Val Phe Ile Thr Asn Tyr Leu Gln 465 470 475 480 Glu Arg Phe Gly Ser Thr Val Gly Asn Met Ile Phe Trp Phe Ile Phe 485 490 495 Cys Ile Phe Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 500 505 510 Met Asn Arg Lys Gly Ser Met Ser 515 520 <210> SEQ ID NO 4 <211> LENGTH: 1537 <212> TYPE: DNA <213> ORGANISM: Brassica napus <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (2)...(1107) <400> SEQUENCE: 4 t taa aac cta act ctt tgc aaa tgg aga ttt tgg att ctg gag gcg tca 49 * Asn Leu Thr Leu Cys Lys Trp Arg Phe Trp Ile Leu Glu Ala Ser 1 5 10 15 cta tgc cga cgg aga acg gtg gtg ccg atc tcg ata cgc ttc gtc acc 97 Leu Cys Arg Arg Arg Thr Val Val Pro Ile Ser Ile Arg Phe Val Thr 20 25 30 gga aac cga gat cgg att ctt cca atg gac ttc ttc ctg att ccg taa 145 Gly Asn Arg Asp Arg Ile Leu Pro Met Asp Phe Phe Leu Ile Pro * 35 40 45 ctg ttt ccg atg ctg acg tga ggg atc ggg ttg att cag ctg ttg agg 193 Leu Phe Pro Met Leu Thr * Gly Ile Gly Leu Ile Gln Leu Leu Arg 50 55 60 ata ctc aag gaa aag cca att tgg ccg gag aaa acg aaa tta ggg aat 241 Ile Leu Lys Glu Lys Pro Ile Trp Pro Glu Lys Thr Lys Leu Gly Asn 65 70 75 ccg gtg gag aag cgg ggg gaa acg tgg atg taa ggt aca cgt atc ggc 289 Pro Val Glu Lys Arg Gly Glu Thr Trp Met * Gly Thr Arg Ile Gly 80 85 90 cgt cgg ttc cag ctc atc gga ggg tgc ggg aga gtc cac tca gct ctg 337 Arg Arg Phe Gln Leu Ile Gly Gly Cys Gly Arg Val His Ser Ala Leu 95 100 105 acg cca tct tca aac aga gcc atg ctg gac tat tca acc tgt gtg tag 385 Thr Pro Ser Ser Asn Arg Ala Met Leu Asp Tyr Ser Thr Cys Val * 110 115 120 tag ttc ttg ttg ctg taa aca gta gac tca tca tcg aaa atc tca tga 433 * Phe Leu Leu Leu * Thr Val Asp Ser Ser Ser Lys Ile Ser * 125 130 135 agt acg gtt ggt tga tca gaa ctg att tct ggt tta gtt caa cgt ctc 481 Ser Thr Val Gly * Ser Glu Leu Ile Ser Gly Leu Val Gln Arg Leu 140 145 150 tgc gag att ggc ccc ttt tca tgt gtt gtc tct ccc ttt caa tct ttc 529 Cys Glu Ile Gly Pro Phe Ser Cys Val Val Ser Pro Phe Gln Ser Phe 155 160 165 ctt tgg ctg cct tta ccg tcg aga aat tag tac ttc aga aat gca tat 577 Leu Trp Leu Pro Leu Pro Ser Arg Asn * Tyr Phe Arg Asn Ala Tyr 170 175 180 ctg aac ctg ttg tca tca ttc ttc ata tta tta tca cca tga ccg agg 625 Leu Asn Leu Leu Ser Ser Phe Phe Ile Leu Leu Ser Pro * Pro Arg 185 190 195 tct tgt atc cag tct atg tca ctc taa ggt gtg att ccg cct tct tat 673 Ser Cys Ile Gln Ser Met Ser Leu * Gly Val Ile Pro Pro Ser Tyr 200 205 210 cag gtg tca cgt tga tgc tcc tca ctt gca ttg tgt ggc tga agt tgg 721 Gln Val Ser Arg * Cys Ser Ser Leu Ala Leu Cys Gly * Ser Trp 215 220 225 ttt ctt acg ctc ata cta act atg aca taa gaa ccc tag cta att cat 769 Phe Leu Thr Leu Ile Leu Thr Met Thr * Glu Pro * Leu Ile His 230 235 240 ctg ata agg cca atc ctg aag tct cct act atg tta gct tga aga gct 817 Leu Ile Arg Pro Ile Leu Lys Ser Pro Thr Met Leu Ala * Arg Ala 245 250 255 tgg cgt att tca tgc ttg ctc cca cat tgt gtt atc agc cga gct atc 865 Trp Arg Ile Ser Cys Leu Leu Pro His Cys Val Ile Ser Arg Ala Ile 260 265 270 cac gtt ctc cat gta tcc gga agg gtt ggg tgg ctc gtc aat ttg caa 913 His Val Leu His Val Ser Gly Arg Val Gly Trp Leu Val Asn Leu Gln 275 280 285 agc tga tca tat tca ctg gat tca tgg gat tta taa tag agc aat ata 961 Ser * Ser Tyr Ser Leu Asp Ser Trp Asp Leu * * Ser Asn Ile 290 295 300 taa atc cta ttg tta gga act caa aac atc ctt tga aag ggg atc tct 1009 * Ile Leu Leu Leu Gly Thr Gln Asn Ile Leu * Lys Gly Ile Ser 305 310 tat acg gtg ttg aaa gag tgt tga agc ttt cag ttc caa att tat acg 1057 Tyr Thr Val Leu Lys Glu Cys * Ser Phe Gln Phe Gln Ile Tyr Thr 315 320 325 tgt ggc tct gca tgt tct act gct tct tcc acc ttt ggt taa aca tat 1105 Cys Gly Ser Ala Cys Ser Thr Ala Ser Ser Thr Phe Gly * Thr Tyr 330 335 340 tg gcagagctcc tctgcttcgg ggatcgtgaa ttctacaaag attggtggaa 1157 tgcaaaaagc gtgggagatt attggagaat gtggaatatg cctgttcata aatggatggt 1217 tcgacatgta tactttccgt gccttcgcag aaatataccg aaagtacccg ctattatcct 1277 tgctttctta gtctctgcag tctttcatga gttatgcatc gcagttcctt gtcgtctctt 1337 caaactatgg gctttcttgg ggattatgtt tcaggtgcct ttggtattta tcacaaacta 1397 cctacaagaa aggtttggct ccatggtggg aaacatgata ttctggttta ccttctgcat 1457 tttcggacaa ccgatgtgtg tgcttcttta ttatcacgac ttgatgaacc gcaaaggaaa 1517 gatgtcatag atgtgtatgt 1537 <210> SEQ ID NO 5 <211> LENGTH: 344 <212> TYPE: PRT <213> ORGANISM: Brassica napus <400> SEQUENCE: 5 Asn Leu Thr Leu Cys Lys Trp Arg Phe Trp Ile Leu Glu Ala Ser Leu 1 5 10 15 Cys Arg Arg Arg Thr Val Val Pro Ile Ser Ile Arg Phe Val Thr Gly 20 25 30 Asn Arg Asp Arg Ile Leu Pro Met Asp Phe Phe Leu Ile Pro Leu Phe 35 40 45 Pro Met Leu Thr Gly Ile Gly Leu Ile Gln Leu Leu Arg Ile Leu Lys 50 55 60 Glu Lys Pro Ile Trp Pro Glu Lys Thr Lys Leu Gly Asn Pro Val Glu 65 70 75 80 Lys Arg Gly Glu Thr Trp Met Gly Thr Arg Ile Gly Arg Arg Phe Gln 85 90 95 Leu Ile Gly Gly Cys Gly Arg Val His Ser Ala Leu Thr Pro Ser Ser 100 105 110 Asn Arg Ala Met Leu Asp Tyr Ser Thr Cys Val Phe Leu Leu Leu Thr 115 120 125 Val Asp Ser Ser Ser Lys Ile Ser Ser Thr Val Gly Ser Glu Leu Ile 130 135 140 Ser Gly Leu Val Gln Arg Leu Cys Glu Ile Gly Pro Phe Ser Cys Val 145 150 155 160 Val Ser Pro Phe Gln Ser Phe Leu Trp Leu Pro Leu Pro Ser Arg Asn 165 170 175 Tyr Phe Arg Asn Ala Tyr Leu Asn Leu Leu Ser Ser Phe Phe Ile Leu 180 185 190 Leu Ser Pro Pro Arg Ser Cys Ile Gln Ser Met Ser Leu Gly Val Ile 195 200 205 Pro Pro Ser Tyr Gln Val Ser Arg Cys Ser Ser Leu Ala Leu Cys Gly 210 215 220 Ser Trp Phe Leu Thr Leu Ile Leu Thr Met Thr Glu Pro Leu Ile His 225 230 235 240 Leu Ile Arg Pro Ile Leu Lys Ser Pro Thr Met Leu Ala Arg Ala Trp 245 250 255 Arg Ile Ser Cys Leu Leu Pro His Cys Val Ile Ser Arg Ala Ile His 260 265 270 Val Leu His Val Ser Gly Arg Val Gly Trp Leu Val Asn Leu Gln Ser 275 280 285 Ser Tyr Ser Leu Asp Ser Trp Asp Leu Ser Asn Ile Ile Leu Leu Leu 290 295 300 Gly Thr Gln Asn Ile Leu Lys Gly Ile Ser Tyr Thr Val Leu Lys Glu 305 310 315 320 Cys Ser Phe Gln Phe Gln Ile Tyr Thr Cys Gly Ser Ala Cys Ser Thr 325 330 335 Ala Ser Ser Thr Phe Gly Thr Tyr 340 <210> SEQ ID NO 6 <211> LENGTH: 1446 <212> TYPE: DNA <213> ORGANISM: Brassica napus <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (82)...(1107) <400> SEQUENCE: 6 cgaaaatctc atgaagtacg gttggttgat cagaactgat ttctggttta gttcaacgtc 60 gctgcgagat tgccgctttt c atg tgt tgt ctc tcc ctt tca atc ttt cct 111 Met Cys Cys Leu Ser Leu Ser Ile Phe Pro 1 5 10 ttg gct gcc ttt acc gtc gag aaa tta gta ctt cag aaa tgc ata tct 159 Leu Ala Ala Phe Thr Val Glu Lys Leu Val Leu Gln Lys Cys Ile Ser 15 20 25 gaa cct gtt gtc atc ttt ctt cat gtt att atc acc atg acc gag gtc 207 Glu Pro Val Val Ile Phe Leu His Val Ile Ile Thr Met Thr Glu Val 30 35 40 ttg tat cca gtc tat gtc act cta agg tgt gat tct gcc ttc tta tca 255 Leu Tyr Pro Val Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser 45 50 55 ggt gac acg ttg atg ctc ctc act tgc att gtg tgg ctg aag ttg gtt 303 Gly Asp Thr Leu Met Leu Leu Thr Cys Ile Val Trp Leu Lys Leu Val 60 65 70 tct tac gct cat act aac tat gac ata aga acc cta gct aat tca tct 351 Ser Tyr Ala His Thr Asn Tyr Asp Ile Arg Thr Leu Ala Asn Ser Ser 75 80 85 90 gat aag gcc aat cct gaa gtc tcc tac tat gtt agc ttg aag agc ttg 399 Asp Lys Ala Asn Pro Glu Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu 95 100 105 gct tat ttc atg ctt gct ccc aca ttg tgt tat cag cca agc tat cca 447 Ala Tyr Phe Met Leu Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro 110 115 120 cgt tct cca tgt atc cgg aag ggt tgg gtg gct cgt caa ttt gca aaa 495 Arg Ser Pro Cys Ile Arg Lys Gly Trp Val Ala Arg Gln Phe Ala Lys 125 130 135 ctg gtc ata ttc act gga ctc atg gga ttt ata ata gag caa tat ata 543 Leu Val Ile Phe Thr Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile 140 145 150 aat cct att gtt agg aac tca aag cat cct ctg aaa ggg gac ctt cta 591 Asn Pro Ile Val Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu 155 160 165 170 tat gct att gaa aga gtg ttg aag ctt tca gtt cca aat cta tat gtg 639 Tyr Ala Ile Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val 175 180 185 tgg ctc tgc atg ttc tac tgc ttc ttc cac ctt tgg tta aac ata ttg 687 Trp Leu Cys Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu 190 195 200 gca gag ctc ctc tgc ttc ggg gac cgt gaa ttc tac aaa gat tgg tgg 735 Ala Glu Leu Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp 205 210 215 aat gca aaa agc gtt gga gat tat tgg aga atg tgg aat atg cct gtt 783 Asn Ala Lys Ser Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val 220 225 230 cac aaa tgg atg gtt cga cat gta tac ttt ccg tgc ctg cgc atc aag 831 His Lys Trp Met Val Arg His Val Tyr Phe Pro Cys Leu Arg Ile Lys 235 240 245 250 ata cca aaa gta ccc gcc att atc att gct ttc tta gtc tct gca gtc 879 Ile Pro Lys Val Pro Ala Ile Ile Ile Ala Phe Leu Val Ser Ala Val 255 260 265 ttt cat gag tta tgc atc gca gtt cct tgc cgt ctc ttc aat cta tgg 927 Phe His Glu Leu Cys Ile Ala Val Pro Cys Arg Leu Phe Asn Leu Trp 270 275 280 gct ttc atg gga att atg ttt cag gtc cct ttg gtc ttt atc aca aac 975 Ala Phe Met Gly Ile Met Phe Gln Val Pro Leu Val Phe Ile Thr Asn 285 290 295 ttt tta caa gaa agg ttt ggc tcc atg gtg gga aac atg atc ttt ggt 1023 Phe Leu Gln Glu Arg Phe Gly Ser Met Val Gly Asn Met Ile Phe Gly 300 305 310 tca gct tct tgc att ttc gga caa ccg atg tgt ggg ctt ctt tat tac 1071 Ser Ala Ser Cys Ile Phe Gly Gln Pro Met Cys Gly Leu Leu Tyr Tyr 315 320 325 330 cat gac ctg atg aac cgc aaa gga tcc atg tcc tga aaaggacttt 1117 His Asp Leu Met Asn Arg Lys Gly Ser Met Ser * 335 340 ttacgcccca aaaaaaaaat tggtcaattg gaaaaatggg agtttttgta tccttttggt 1177 agccgttaaa atgcctttaa aaagacgaat cctttggagt tcttgtttct cttggtctct 1237 gtcccccacg ggattttcta tttctcgtct tttaacaagc ccataaaaaa aagtagactg 1297 agataattgg attttgttat gctgtaaaaa aaatttcatt caaaaatgtt tgaataatct 1357 ttgacgattc ccaaaatccc gagaaaaata aaagtaagcc tttccttttt aaaaaaaaaa 1417 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1446 <210> SEQ ID NO 7 <211> LENGTH: 341 <212> TYPE: PRT <213> ORGANISM: Brassica napus <400> SEQUENCE: 7 Met Cys Cys Leu Ser Leu Ser Ile Phe Pro Leu Ala Ala Phe Thr Val 1 5 10 15 Glu Lys Leu Val Leu Gln Lys Cys Ile Ser Glu Pro Val Val Ile Phe 20 25 30 Leu His Val Ile Ile Thr Met Thr Glu Val Leu Tyr Pro Val Tyr Val 35 40 45 Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Asp Thr Leu Met Leu 50 55 60 Leu Thr Cys Ile Val Trp Leu Lys Leu Val Ser Tyr Ala His Thr Asn 65 70 75 80 Tyr Asp Ile Arg Thr Leu Ala Asn Ser Ser Asp Lys Ala Asn Pro Glu 85 90 95 Val Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Leu Ala 100 105 110 Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Pro Cys Ile Arg 115 120 125 Lys Gly Trp Val Ala Arg Gln Phe Ala Lys Leu Val Ile Phe Thr Gly 130 135 140 Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val Arg Asn 145 150 155 160 Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala Ile Glu Arg Val 165 170 175 Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 180 185 190 Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu Cys Phe 195 200 205 Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser Val Gly 210 215 220 Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg 225 230 235 240 His Val Tyr Phe Pro Cys Leu Arg Ile Lys Ile Pro Lys Val Pro Ala 245 250 255 Ile Ile Ile Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys Ile 260 265 270 Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly Ile Met 275 280 285 Phe Gln Val Pro Leu Val Phe Ile Thr Asn Phe Leu Gln Glu Arg Phe 290 295 300 Gly Ser Met Val Gly Asn Met Ile Phe Gly Ser Ala Ser Cys Ile Phe 305 310 315 320 Gly Gln Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met Asn Arg 325 330 335 Lys Gly Ser Met Ser 340 <210> SEQ ID NO 8 <211> LENGTH: 1512 <212> TYPE: DNA <213> ORGANISM: Brassica napus <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)...(1512) <400> SEQUENCE: 8 atg gcg att ttg gat tct gga ggc gtc gct gta ccg ccg acg gag aac 48 Met Ala Ile Leu Asp Ser Gly Gly Val Ala Val Pro Pro Thr Glu Asn 1 5 10 15 ggc gtc gcg gat ctc gac agg ctc cac cgt cgt aaa tcg agt tcg gat 96 Gly Val Ala Asp Leu Asp Arg Leu His Arg Arg Lys Ser Ser Ser Asp 20 25 30 tct tcc aac gga ctc ctc tcc gat act tcc ccg tcg gac gat gtt gga 144 Ser Ser Asn Gly Leu Leu Ser Asp Thr Ser Pro Ser Asp Asp Val Gly 35 40 45 gct gcg gcg gcc gaa agg gat cgg gtt gat tcc gct gcc gag gag gag 192 Ala Ala Ala Ala Glu Arg Asp Arg Val Asp Ser Ala Ala Glu Glu Glu 50 55 60 gct cag gga aca gcg aat tta gct ggc gga gat gcc gaa act agg gaa 240 Ala Gln Gly Thr Ala Asn Leu Ala Gly Gly Asp Ala Glu Thr Arg Glu 65 70 75 80 tcc gcc gga ggc gat gta agg ttt acg tat cga ccg tcg gtt cca gct 288 Ser Ala Gly Gly Asp Val Arg Phe Thr Tyr Arg Pro Ser Val Pro Ala 85 90 95 cat cgg agg acg agg gag agt cct ctc agc tcc gac gct atc ttc aaa 336 His Arg Arg Thr Arg Glu Ser Pro Leu Ser Ser Asp Ala Ile Phe Lys 100 105 110 caa agc cat gca gga ttg ttc aac ctc tgt gta gtt gtt ctt gtt gct 384 Gln Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Val Ala 115 120 125 gtt aac agt aga ctc atc atc gaa aac ctc atg aag tat ggt tgg ttg 432 Val Asn Ser Arg Leu Ile Ile Glu Asn Leu Met Lys Tyr Gly Trp Leu 130 135 140 atc aga act gat ttt tgg ttt agt tct aca tcc tta cga gac tgg ccg 480 Ile Arg Thr Asp Phe Trp Phe Ser Ser Thr Ser Leu Arg Asp Trp Pro 145 150 155 160 ctt ttc atg tgt tgt ctt tca ctt tcg gtc ttt cct ttg gct gcc ttc 528 Leu Phe Met Cys Cys Leu Ser Leu Ser Val Phe Pro Leu Ala Ala Phe 165 170 175 acg gtc gag aaa atg gta ctt cag aaa ttc ata tct gag cct gtt gcc 576 Thr Val Glu Lys Met Val Leu Gln Lys Phe Ile Ser Glu Pro Val Ala 180 185 190 atc att ctt cat gtc att ata acc atg aca gag gtc ttg tat cca gtc 624 Ile Ile Leu His Val Ile Ile Thr Met Thr Glu Val Leu Tyr Pro Val 195 200 205 tac gtc aca ctg agg tgt gat tct gcc ttc ttg tca ggt gtc acg ttg 672 Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu 210 215 220 atg ctg ctc act tgc att gtg tgg ctg aag ttg gtt tct tac gct cat 720 Met Leu Leu Thr Cys Ile Val Trp Leu Lys Leu Val Ser Tyr Ala His 225 230 235 240 act agc tac gac ata aga acc ctg gcc aat tca gct gat aag gtc gat 768 Thr Ser Tyr Asp Ile Arg Thr Leu Ala Asn Ser Ala Asp Lys Val Asp 245 250 255 cct gaa atc tcc tac tat gtt agc ttg aag agc ttg gcg tat ttc atg 816 Pro Glu Ile Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 260 265 270 gtt gct ccc aca ctg tgt tat cag cca agc tat cca cgt tct cca tgt 864 Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Pro Cys 275 280 285 atc cgg aag ggt tgg gtg gct cgt caa ctt gca aaa ctg gtc ata ttc 912 Ile Arg Lys Gly Trp Val Ala Arg Gln Leu Ala Lys Leu Val Ile Phe 290 295 300 act gga ctc atg gga ttt ata ata gag caa tat ata aat cct att gtt 960 Thr Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val 305 310 315 320 agg aac tca aag cat cct ctg aaa ggg gac ctt cta tat gct att gaa 1008 Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala Ile Glu 325 330 335 aga gtg ttg aag ctt tca gtt cca aat cta tat gtg tgg ctc tgc atg 1056 Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 340 345 350 ttc tac tgc ttc ttc cac ctt tgg tta aac ata ttg gca gag ctc ctc 1104 Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu 355 360 365 tgc ttc ggg gac cgt gaa ttc tac aaa gat tgg tgg aat gca aaa agc 1152 Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser 370 375 380 gtt gga gat tat tgg aga atg tgg aat atg cct gtt cac aaa tgg atg 1200 Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 385 390 395 400 gtt cga cat gta tac ttt ccg tgc ctg cgc atc aag ata cca aaa gta 1248 Val Arg His Val Tyr Phe Pro Cys Leu Arg Ile Lys Ile Pro Lys Val 405 410 415 ccc gcc att atc att gct ttc tta gtc tct gca gtc ttt cat gag tta 1296 Pro Ala Ile Ile Ile Ala Phe Leu Val Ser Ala Val Phe His Glu Leu 420 425 430 tgc atc gca gtt cct tgc cgt ctc ttc aat cta tgg gct ttc atg gga 1344 Cys Ile Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly 435 440 445 att atg ttt cag gtc cct ttg gtc ttt atc aca aac ttt tta caa gaa 1392 Ile Met Phe Gln Val Pro Leu Val Phe Ile Thr Asn Phe Leu Gln Glu 450 455 460 agg ttt ggc tcc atg gtg gga aac atg atc ttt ggt tca gct tct tgc 1440 Arg Phe Gly Ser Met Val Gly Asn Met Ile Phe Gly Ser Ala Ser Cys 465 470 475 480 att ttc gga caa ccg atg tgt ggg ctt ctt tat tac cat gac ctg atg 1488 Ile Phe Gly Gln Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met 485 490 495 aac cgc aaa gga tcc atg tcc tga 1512 Asn Arg Lys Gly Ser Met Ser * 500 <210> SEQ ID NO 9 <211> LENGTH: 503 <212> TYPE: PRT <213> ORGANISM: Brassica napus <400> SEQUENCE: 9 Met Ala Ile Leu Asp Ser Gly Gly Val Ala Val Pro Pro Thr Glu Asn 1 5 10 15 Gly Val Ala Asp Leu Asp Arg Leu His Arg Arg Lys Ser Ser Ser Asp 20 25 30 Ser Ser Asn Gly Leu Leu Ser Asp Thr Ser Pro Ser Asp Asp Val Gly 35 40 45 Ala Ala Ala Ala Glu Arg Asp Arg Val Asp Ser Ala Ala Glu Glu Glu 50 55 60 Ala Gln Gly Thr Ala Asn Leu Ala Gly Gly Asp Ala Glu Thr Arg Glu 65 70 75 80 Ser Ala Gly Gly Asp Val Arg Phe Thr Tyr Arg Pro Ser Val Pro Ala 85 90 95 His Arg Arg Thr Arg Glu Ser Pro Leu Ser Ser Asp Ala Ile Phe Lys 100 105 110 Gln Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Val Ala 115 120 125 Val Asn Ser Arg Leu Ile Ile Glu Asn Leu Met Lys Tyr Gly Trp Leu 130 135 140 Ile Arg Thr Asp Phe Trp Phe Ser Ser Thr Ser Leu Arg Asp Trp Pro 145 150 155 160 Leu Phe Met Cys Cys Leu Ser Leu Ser Val Phe Pro Leu Ala Ala Phe 165 170 175 Thr Val Glu Lys Met Val Leu Gln Lys Phe Ile Ser Glu Pro Val Ala 180 185 190 Ile Ile Leu His Val Ile Ile Thr Met Thr Glu Val Leu Tyr Pro Val 195 200 205 Tyr Val Thr Leu Arg Cys Asp Ser Ala Phe Leu Ser Gly Val Thr Leu 210 215 220 Met Leu Leu Thr Cys Ile Val Trp Leu Lys Leu Val Ser Tyr Ala His 225 230 235 240 Thr Ser Tyr Asp Ile Arg Thr Leu Ala Asn Ser Ala Asp Lys Val Asp 245 250 255 Pro Glu Ile Ser Tyr Tyr Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 260 265 270 Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Pro Cys 275 280 285 Ile Arg Lys Gly Trp Val Ala Arg Gln Leu Ala Lys Leu Val Ile Phe 290 295 300 Thr Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val 305 310 315 320 Arg Asn Ser Lys His Pro Leu Lys Gly Asp Leu Leu Tyr Ala Ile Glu 325 330 335 Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 340 345 350 Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu 355 360 365 Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Ser 370 375 380 Val Gly Asp Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 385 390 395 400 Val Arg His Val Tyr Phe Pro Cys Leu Arg Ile Lys Ile Pro Lys Val 405 410 415 Pro Ala Ile Ile Ile Ala Phe Leu Val Ser Ala Val Phe His Glu Leu 420 425 430 Cys Ile Ala Val Pro Cys Arg Leu Phe Asn Leu Trp Ala Phe Met Gly 435 440 445 Ile Met Phe Gln Val Pro Leu Val Phe Ile Thr Asn Phe Leu Gln Glu 450 455 460 Arg Phe Gly Ser Met Val Gly Asn Met Ile Phe Gly Ser Ala Ser Cys 465 470 475 480 Ile Phe Gly Gln Pro Met Cys Gly Leu Leu Tyr Tyr His Asp Leu Met 485 490 495 Asn Arg Lys Gly Ser Met Ser 500 <210> SEQ ID NO 10 <211> LENGTH: 2090 <212> TYPE: DNA <213> ORGANISM: Tropaeolum majus <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (171)...(1727) <400> SEQUENCE: 10 acgcggggag ttttcaaaat catattatgc tttttcttca ctactgcatg aactttcttt 60 ctacttcttg caactgattt gtaatcctta cacatgtttc tagttttctc catataaaaa 120 aaatattctc tgagcttctc gattctctag agagagaagg ccaaaaaaaa atg gcg 176 Met Ala 1 gtg gca gag tcg tca cag aac acg aca acc atg agt ggt cac ggc gac 224 Val Ala Glu Ser Ser Gln Asn Thr Thr Thr Met Ser Gly His Gly Asp 5 10 15 tcg gat ctc aac aat ttc cgt aga agg aaa ccg agt tcc tcc gtg att 272 Ser Asp Leu Asn Asn Phe Arg Arg Arg Lys Pro Ser Ser Ser Val Ile 20 25 30 gaa cct tcg tcg tcc ggt ttt aca tcc acc aat ggc gta ccg gcg act 320 Glu Pro Ser Ser Ser Gly Phe Thr Ser Thr Asn Gly Val Pro Ala Thr 35 40 45 50 ggc cac gtg gct gag aat cgt gac cag gat cgg gta ggg gct atg gag 368 Gly His Val Ala Glu Asn Arg Asp Gln Asp Arg Val Gly Ala Met Glu 55 60 65 aac gca aca gga tcg gtc aac tta att gga aat ggt gga ggc gtg gtt 416 Asn Ala Thr Gly Ser Val Asn Leu Ile Gly Asn Gly Gly Gly Val Val 70 75 80 atc ggg aat gaa gag aaa cag gta ggg gag act gat ata cga ttc act 464 Ile Gly Asn Glu Glu Lys Gln Val Gly Glu Thr Asp Ile Arg Phe Thr 85 90 95 tac cgg cct tcg ttt ccg gct cat cgg agg gtg agg gag agt cct ctt 512 Tyr Arg Pro Ser Phe Pro Ala His Arg Arg Val Arg Glu Ser Pro Leu 100 105 110 agc tct gat gca atc ttc aaa cag agc cat gcg ggt tta ttc aac ttg 560 Ser Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe Asn Leu 115 120 125 130 tgt ata gta gtg ctc att gca gta aac agt agg ctt atc atc gaa aat 608 Cys Ile Val Val Leu Ile Ala Val Asn Ser Arg Leu Ile Ile Glu Asn 135 140 145 ctt atg aag tat ggt tgg ttg atc gat act ggt ttc tgg ttt aac tca 656 Leu Met Lys Tyr Gly Trp Leu Ile Asp Thr Gly Phe Trp Phe Asn Ser 150 155 160 aga tca ctg ggt gat tgg tcc atc ttt atg tgc tgt ctt aca ctc cca 704 Arg Ser Leu Gly Asp Trp Ser Ile Phe Met Cys Cys Leu Thr Leu Pro 165 170 175 att ttc cca ctt gct gct ttt att gtt gaa aag ctg gtg cag cga aat 752 Ile Phe Pro Leu Ala Ala Phe Ile Val Glu Lys Leu Val Gln Arg Asn 180 185 190 cat ata tct gaa ctt gtt gct gtt ctc ctt cat gta atc gtt tct acc 800 His Ile Ser Glu Leu Val Ala Val Leu Leu His Val Ile Val Ser Thr 195 200 205 210 gct gca gtt tta tat cca gtt att gtg atc tta acg tgt gat tcg gtg 848 Ala Ala Val Leu Tyr Pro Val Ile Val Ile Leu Thr Cys Asp Ser Val 215 220 225 tat atg tct ggt gtg gta ttg atg ctc ttt ggt tgc att atg tgg ttg 896 Tyr Met Ser Gly Val Val Leu Met Leu Phe Gly Cys Ile Met Trp Leu 230 235 240 aag ctg gtg tca tat gca cat act agt tct gat att aga aca ctg gcc 944 Lys Leu Val Ser Tyr Ala His Thr Ser Ser Asp Ile Arg Thr Leu Ala 245 250 255 aaa tct ggc tat aag ggg gat gcg cac ccc aat tca acc att gtg agt 992 Lys Ser Gly Tyr Lys Gly Asp Ala His Pro Asn Ser Thr Ile Val Ser 260 265 270 tgc tca tat gat gtt agc ttg aag agt ttg gca tac ttc atg gtg gcg 1040 Cys Ser Tyr Asp Val Ser Leu Lys Ser Leu Ala Tyr Phe Met Val Ala 275 280 285 290 ccg aca tta tgt tac cag cct agc tat cct cgt tcg tcg tgt atc cgc 1088 Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Ser Cys Ile Arg 295 300 305 aag ggt tgg gtt gtt cgt caa ttt gtc aaa cta ata gtt ttc ata gga 1136 Lys Gly Trp Val Val Arg Gln Phe Val Lys Leu Ile Val Phe Ile Gly 310 315 320 ctc atg ggg ttc att ata gaa caa tat att aat cct atc gtt cga aat 1184 Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val Arg Asn 325 330 335 tcc aaa cac cca ttg aaa gga gat ttt tta tat gca ata gaa aga gtt 1232 Ser Lys His Pro Leu Lys Gly Asp Phe Leu Tyr Ala Ile Glu Arg Val 340 345 350 ttg aag ctt tca gtt cca aat cta tat gtt tgg ctt tgc atg ttc tac 1280 Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 355 360 365 370 tct ttt ttc cac ctc tgg ttg aac ata ctg gct gag ctt ctt cgc ttt 1328 Ser Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu Arg Phe 375 380 385 ggt gat cgt gaa ttc tac aaa gat tgg tgg aat gca aaa act gtt gcg 1376 Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Thr Val Ala 390 395 400 gag tat tgg aaa atg tgg aat atg cct gtt cat aga tgg atg gtt cgt 1424 Glu Tyr Trp Lys Met Trp Asn Met Pro Val His Arg Trp Met Val Arg 405 410 415 cat cta tat ttt ccc tgt ttg agg aat ggg ata ccc aag gaa ggt gcc 1472 His Leu Tyr Phe Pro Cys Leu Arg Asn Gly Ile Pro Lys Glu Gly Ala 420 425 430 att att atc gcg ttc tta gtt tct ggt gct ttc cat gag ctc tgc att 1520 Ile Ile Ile Ala Phe Leu Val Ser Gly Ala Phe His Glu Leu Cys Ile 435 440 445 450 gca gtt cct tgc cac gta ttc aag tta tgg gcc ttt ata ggc att atg 1568 Ala Val Pro Cys His Val Phe Lys Leu Trp Ala Phe Ile Gly Ile Met 455 460 465 ttt cag gtt ccc ttg gta ttg att acg aat tat cta caa gaa aag ttc 1616 Phe Gln Val Pro Leu Val Leu Ile Thr Asn Tyr Leu Gln Glu Lys Phe 470 475 480 agt aat tct atg gtg ggc aat atg atc ttc tgg ttc atc ttc tgc ata 1664 Ser Asn Ser Met Val Gly Asn Met Ile Phe Trp Phe Ile Phe Cys Ile 485 490 495 ctt ggc caa cct atg tgt gtc ctt cta tat tac cat gac ctg ata aat 1712 Leu Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu Ile Asn 500 505 510 cta aag gaa aag tga aaaaatggaa gttgcctatg ctcagagtat tcctatccca 1767 Leu Lys Glu Lys * 515 atgcacacat tatatggttc tgtacaatct gtgccccctt catcctttac acgtacccat 1827 gctggttcct gcacgatgat ttgccttttg tttgtaagca atatttggag agagtccaat 1887 ttaggaagtg actagtgtgg cttatatctt gtatactacc tttagtcatg ggggggtttt 1947 tatattacta gtaccaaaag tcaagttgta tatgatttac ggtttagttt ctttcatgtt 2007 ttttgttttt gtgtaaatat acgtttcata tatcactgtt ttttcaaagt aaaatcaata 2067 ataccccata gatgttgaaa ctg 2090 <210> SEQ ID NO 11 <211> LENGTH: 518 <212> TYPE: PRT <213> ORGANISM: Tropaeolum majus <400> SEQUENCE: 11 Met Ala Val Ala Glu Ser Ser Gln Asn Thr Thr Thr Met Ser Gly His 1 5 10 15 Gly Asp Ser Asp Leu Asn Asn Phe Arg Arg Arg Lys Pro Ser Ser Ser 20 25 30 Val Ile Glu Pro Ser Ser Ser Gly Phe Thr Ser Thr Asn Gly Val Pro 35 40 45 Ala Thr Gly His Val Ala Glu Asn Arg Asp Gln Asp Arg Val Gly Ala 50 55 60 Met Glu Asn Ala Thr Gly Ser Val Asn Leu Ile Gly Asn Gly Gly Gly 65 70 75 80 Val Val Ile Gly Asn Glu Glu Lys Gln Val Gly Glu Thr Asp Ile Arg 85 90 95 Phe Thr Tyr Arg Pro Ser Phe Pro Ala His Arg Arg Val Arg Glu Ser 100 105 110 Pro Leu Ser Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe 115 120 125 Asn Leu Cys Ile Val Val Leu Ile Ala Val Asn Ser Arg Leu Ile Ile 130 135 140 Glu Asn Leu Met Lys Tyr Gly Trp Leu Ile Asp Thr Gly Phe Trp Phe 145 150 155 160 Asn Ser Arg Ser Leu Gly Asp Trp Ser Ile Phe Met Cys Cys Leu Thr 165 170 175 Leu Pro Ile Phe Pro Leu Ala Ala Phe Ile Val Glu Lys Leu Val Gln 180 185 190 Arg Asn His Ile Ser Glu Leu Val Ala Val Leu Leu His Val Ile Val 195 200 205 Ser Thr Ala Ala Val Leu Tyr Pro Val Ile Val Ile Leu Thr Cys Asp 210 215 220 Ser Val Tyr Met Ser Gly Val Val Leu Met Leu Phe Gly Cys Ile Met 225 230 235 240 Trp Leu Lys Leu Val Ser Tyr Ala His Thr Ser Ser Asp Ile Arg Thr 245 250 255 Leu Ala Lys Ser Gly Tyr Lys Gly Asp Ala His Pro Asn Ser Thr Ile 260 265 270 Val Ser Cys Ser Tyr Asp Val Ser Leu Lys Ser Leu Ala Tyr Phe Met 275 280 285 Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Ser Ser Cys 290 295 300 Ile Arg Lys Gly Trp Val Val Arg Gln Phe Val Lys Leu Ile Val Phe 305 310 315 320 Ile Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val 325 330 335 Arg Asn Ser Lys His Pro Leu Lys Gly Asp Phe Leu Tyr Ala Ile Glu 340 345 350 Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 355 360 365 Phe Tyr Ser Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu 370 375 380 Arg Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Thr 385 390 395 400 Val Ala Glu Tyr Trp Lys Met Trp Asn Met Pro Val His Arg Trp Met 405 410 415 Val Arg His Leu Tyr Phe Pro Cys Leu Arg Asn Gly Ile Pro Lys Glu 420 425 430 Gly Ala Ile Ile Ile Ala Phe Leu Val Ser Gly Ala Phe His Glu Leu 435 440 445 Cys Ile Ala Val Pro Cys His Val Phe Lys Leu Trp Ala Phe Ile Gly 450 455 460 Ile Met Phe Gln Val Pro Leu Val Leu Ile Thr Asn Tyr Leu Gln Glu 465 470 475 480 Lys Phe Ser Asn Ser Met Val Gly Asn Met Ile Phe Trp Phe Ile Phe 485 490 495 Cys Ile Leu Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 500 505 510 Ile Asn Leu Lys Glu Lys 515 <210> SEQ ID NO 12 <211> LENGTH: 2099 <212> TYPE: DNA <213> ORGANISM: Nicotiana tabacum <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (208)...(1806) <400> SEQUENCE: 12 ggcacgagcg aaatcttacc caatcctccg ttgcttttct tttagatcct ctttttctgt 60 cattctcttt ttcccaataa caacaactca ttgcatgtga aggttgattt tgatttttgt 120 gtttattcaa actctctctt cacgattttc ttactctttc tagaagtatc cattactttt 180 tagtctgtga ttcggcgaaa gtaagca atg gtg atc atg gaa ttg ccg gag agc 234 Met Val Ile Met Glu Leu Pro Glu Ser 1 5 gtc gaa atg acg acg acg acg acg act tcg ggt att gag aac ctc aac 282 Val Glu Met Thr Thr Thr Thr Thr Thr Ser Gly Ile Glu Asn Leu Asn 10 15 20 25 tcc gat ctt aat cac tcg gtt cgg agg aga cgt ggc agt aat ggt ttt 330 Ser Asp Leu Asn His Ser Val Arg Arg Arg Arg Gly Ser Asn Gly Phe 30 35 40 gag gcg gct agt gca att aac agt tcg gat gcg aat atg agc gaa gat 378 Glu Ala Ala Ser Ala Ile Asn Ser Ser Asp Ala Asn Met Ser Glu Asp 45 50 55 aga aga gat gtg tgt ggc agc ggt gct gga ttg gaa acg gtg aat gag 426 Arg Arg Asp Val Cys Gly Ser Gly Ala Gly Leu Glu Thr Val Asn Glu 60 65 70 cgg agt aaa tcg gtt ggt gag tcc agt gat gta att cga aag gag gac 474 Arg Ser Lys Ser Val Gly Glu Ser Ser Asp Val Ile Arg Lys Glu Asp 75 80 85 gac agg aat gat aat gtt gcg aat ggt gag gaa agc aaa tca acg gaa 522 Asp Arg Asn Asp Asn Val Ala Asn Gly Glu Glu Ser Lys Ser Thr Glu 90 95 100 105 aca aca acg acg ccg ttt aaa ttt gct tac agg gcg tcg gca cca gct 570 Thr Thr Thr Thr Pro Phe Lys Phe Ala Tyr Arg Ala Ser Ala Pro Ala 110 115 120 cac cgg cga atc aag gag agt cct ctc agc tcc gac gcc att ttc aaa 618 His Arg Arg Ile Lys Glu Ser Pro Leu Ser Ser Asp Ala Ile Phe Lys 125 130 135 cag agt cac gca ggc ctg ttc aat ctc tgt gtg gtg gtg ctg att gct 666 Gln Ser His Ala Gly Leu Phe Asn Leu Cys Val Val Val Leu Ile Ala 140 145 150 gtt aac agc agg ctg att atc gag aac ttg atg aag tat ggc ctt tta 714 Val Asn Ser Arg Leu Ile Ile Glu Asn Leu Met Lys Tyr Gly Leu Leu 155 160 165 att agg gct ggc ttt tgg ttt agc tcg aag tcg ttg agg gat tgg ccg 762 Ile Arg Ala Gly Phe Trp Phe Ser Ser Lys Ser Leu Arg Asp Trp Pro 170 175 180 185 ctt cta atg tgc tgt ctc agt ctc caa att ttg ccg ctc gct gct ttt 810 Leu Leu Met Cys Cys Leu Ser Leu Gln Ile Leu Pro Leu Ala Ala Phe 190 195 200 ctt gtg gag aag ttg gca cag cag agg cat ttg act gag cgt gcg gtg 858 Leu Val Glu Lys Leu Ala Gln Gln Arg His Leu Thr Glu Arg Ala Val 205 210 215 gtt act ctt cac ata act ata aca aca gct gcc att ttg tat cca gtt 906 Val Thr Leu His Ile Thr Ile Thr Thr Ala Ala Ile Leu Tyr Pro Val 220 225 230 ctg gtc att ctt ggg tgt gat tct gct ttt ctg ttt ggt gtc ata ttg 954 Leu Val Ile Leu Gly Cys Asp Ser Ala Phe Leu Phe Gly Val Ile Leu 235 240 245 atg ctg gtt gct tgc att gtg tgg atg aag ctg gtt tct tac gca cat 1002 Met Leu Val Ala Cys Ile Val Trp Met Lys Leu Val Ser Tyr Ala His 250 255 260 265 aca aat cat gat atg aga cag ctc gca aag tct acg gac aag gat gaa 1050 Thr Asn His Asp Met Arg Gln Leu Ala Lys Ser Thr Asp Lys Asp Glu 270 275 280 act tca gat ggg gat ttc tct tat gat gtt agc ttc aag agt ttg gct 1098 Thr Ser Asp Gly Asp Phe Ser Tyr Asp Val Ser Phe Lys Ser Leu Ala 285 290 295 tac ttc atg gtt gcg cca aca tta tgt tat cag ctt agc tat ccc cac 1146 Tyr Phe Met Val Ala Pro Thr Leu Cys Tyr Gln Leu Ser Tyr Pro His 300 305 310 act cca tgc att cgc aaa ggt tgg gtg gca cgc caa ttc atc aag ctg 1194 Thr Pro Cys Ile Arg Lys Gly Trp Val Ala Arg Gln Phe Ile Lys Leu 315 320 325 gta ata ttt aca gga ttg atg gga ttt atc ata gaa cag tac att aac 1242 Val Ile Phe Thr Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn 330 335 340 345 cca att gtg caa aac tca caa cat cct ttg aaa gga aac ctt tta tat 1290 Pro Ile Val Gln Asn Ser Gln His Pro Leu Lys Gly Asn Leu Leu Tyr 350 355 360 gcc atc gag agg gta ttg aag ctt tcg gtt cca aat tta tat gtc tgg 1338 Ala Ile Glu Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp 365 370 375 ctc tgc atg ttt tac tgc ttc ttt cat ctt tgg cta aat ata ctt gcg 1386 Leu Cys Met Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala 380 385 390 gaa cta cta tgt ttt ggt gat cgt gag ttc tac aag gat tgg tgg aat 1434 Glu Leu Leu Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn 395 400 405 gcc aaa aca att gat gag tac tgg agg atg tgg aat atg cct gtt cat 1482 Ala Lys Thr Ile Asp Glu Tyr Trp Arg Met Trp Asn Met Pro Val His 410 415 420 425 aag tgg atg gtt cgt cac att tat ttc cct tgc tta aga aac gga att 1530 Lys Trp Met Val Arg His Ile Tyr Phe Pro Cys Leu Arg Asn Gly Ile 430 435 440 cca aag ggg gtc gca ata ctg att gct ttc ctt gta tct gct gtt ttc 1578 Pro Lys Gly Val Ala Ile Leu Ile Ala Phe Leu Val Ser Ala Val Phe 445 450 455 cac gag ctg tgt att gct gtt cca tgt cgc ctt ttc aag tgg tgg gca 1626 His Glu Leu Cys Ile Ala Val Pro Cys Arg Leu Phe Lys Trp Trp Ala 460 465 470 ttc atg gga att atg ttc cag gtt cct ttg gtc ata ctc aca aac ttc 1674 Phe Met Gly Ile Met Phe Gln Val Pro Leu Val Ile Leu Thr Asn Phe 475 480 485 tta caa aac aag ttc caa agc tcg atg gtg ggc aat atg atg ttc tgg 1722 Leu Gln Asn Lys Phe Gln Ser Ser Met Val Gly Asn Met Met Phe Trp 490 495 500 505 tgc ttt ttc tgc att ctt ggt cag cca atg tgt gtg ctt ctg tat tac 1770 Cys Phe Phe Cys Ile Leu Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr 510 515 520 cac gat gtg atg aat aga aaa agc agt gca cgt taa gcttcatcca 1816 His Asp Val Met Asn Arg Lys Ser Ser Ala Arg * 525 530 gggatgaatt gttgtatgag caagtatttt aagttttgga tcccaagctc tattctactg 1876 tttctggcaa ggcattcctg ctatttcctt catcagttcc aacaatattc agatgatacg 1936 aaatatctgt ttggaatgca caacacaagc cacggccaga gatgctgatg tctcacattt 1996 tattgtgttc ttcatgtcgg agaaatgtaa aatactatct tgagataact ctcatgttag 2056 taaatacctt tttgcctcta aaaaaaaaaa aaaaaaaaaa aaa 2099 <210> SEQ ID NO 13 <211> LENGTH: 532 <212> TYPE: PRT <213> ORGANISM: Nicotiana tabacum <400> SEQUENCE: 13 Met Val Ile Met Glu Leu Pro Glu Ser Val Glu Met Thr Thr Thr Thr 1 5 10 15 Thr Thr Ser Gly Ile Glu Asn Leu Asn Ser Asp Leu Asn His Ser Val 20 25 30 Arg Arg Arg Arg Gly Ser Asn Gly Phe Glu Ala Ala Ser Ala Ile Asn 35 40 45 Ser Ser Asp Ala Asn Met Ser Glu Asp Arg Arg Asp Val Cys Gly Ser 50 55 60 Gly Ala Gly Leu Glu Thr Val Asn Glu Arg Ser Lys Ser Val Gly Glu 65 70 75 80 Ser Ser Asp Val Ile Arg Lys Glu Asp Asp Arg Asn Asp Asn Val Ala 85 90 95 Asn Gly Glu Glu Ser Lys Ser Thr Glu Thr Thr Thr Thr Pro Phe Lys 100 105 110 Phe Ala Tyr Arg Ala Ser Ala Pro Ala His Arg Arg Ile Lys Glu Ser 115 120 125 Pro Leu Ser Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe 130 135 140 Asn Leu Cys Val Val Val Leu Ile Ala Val Asn Ser Arg Leu Ile Ile 145 150 155 160 Glu Asn Leu Met Lys Tyr Gly Leu Leu Ile Arg Ala Gly Phe Trp Phe 165 170 175 Ser Ser Lys Ser Leu Arg Asp Trp Pro Leu Leu Met Cys Cys Leu Ser 180 185 190 Leu Gln Ile Leu Pro Leu Ala Ala Phe Leu Val Glu Lys Leu Ala Gln 195 200 205 Gln Arg His Leu Thr Glu Arg Ala Val Val Thr Leu His Ile Thr Ile 210 215 220 Thr Thr Ala Ala Ile Leu Tyr Pro Val Leu Val Ile Leu Gly Cys Asp 225 230 235 240 Ser Ala Phe Leu Phe Gly Val Ile Leu Met Leu Val Ala Cys Ile Val 245 250 255 Trp Met Lys Leu Val Ser Tyr Ala His Thr Asn His Asp Met Arg Gln 260 265 270 Leu Ala Lys Ser Thr Asp Lys Asp Glu Thr Ser Asp Gly Asp Phe Ser 275 280 285 Tyr Asp Val Ser Phe Lys Ser Leu Ala Tyr Phe Met Val Ala Pro Thr 290 295 300 Leu Cys Tyr Gln Leu Ser Tyr Pro His Thr Pro Cys Ile Arg Lys Gly 305 310 315 320 Trp Val Ala Arg Gln Phe Ile Lys Leu Val Ile Phe Thr Gly Leu Met 325 330 335 Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val Gln Asn Ser Gln 340 345 350 His Pro Leu Lys Gly Asn Leu Leu Tyr Ala Ile Glu Arg Val Leu Lys 355 360 365 Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr Cys Phe 370 375 380 Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu Cys Phe Gly Asp 385 390 395 400 Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Lys Thr Ile Asp Glu Tyr 405 410 415 Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg His Ile 420 425 430 Tyr Phe Pro Cys Leu Arg Asn Gly Ile Pro Lys Gly Val Ala Ile Leu 435 440 445 Ile Ala Phe Leu Val Ser Ala Val Phe His Glu Leu Cys Ile Ala Val 450 455 460 Pro Cys Arg Leu Phe Lys Trp Trp Ala Phe Met Gly Ile Met Phe Gln 465 470 475 480 Val Pro Leu Val Ile Leu Thr Asn Phe Leu Gln Asn Lys Phe Gln Ser 485 490 495 Ser Met Val Gly Asn Met Met Phe Trp Cys Phe Phe Cys Ile Leu Gly 500 505 510 Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Val Met Asn Arg Lys 515 520 525 Ser Ser Ala Arg 530 <210> SEQ ID NO 14 <211> LENGTH: 1964 <212> TYPE: DNA <213> ORGANISM: Piralla frutescens <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (69)...(1673) <400> SEQUENCE: 14 tttagaacca aactattctc cgttaagttc tgagttcgat ttctttcttt tctcaaattt 60 tccgtgcg atg gcg atc ttg gac tcg ccg gag atc ctg gac acg acg tcg 110 Met Ala Ile Leu Asp Ser Pro Glu Ile Leu Asp Thr Thr Ser 1 5 10 tcc agt gcc gac aac ggc gcc gca cat cac acc act ctt cgc cgg aga 158 Ser Ser Ala Asp Asn Gly Ala Ala His His Thr Thr Leu Arg Arg Arg 15 20 25 30 caa agt gcg cgc tcc gtt ccg cct ctt ctc gac tcc gat tcc aac tct 206 Gln Ser Ala Arg Ser Val Pro Pro Leu Leu Asp Ser Asp Ser Asn Ser 35 40 45 ctg gag gca gag agc gca atc aat gat tcc gaa aat gtt cga aac gac 254 Leu Glu Ala Glu Ser Ala Ile Asn Asp Ser Glu Asn Val Arg Asn Asp 50 55 60 gct aat ttg atc gaa aat ctc cgc ggc gga gcc gtg gaa tcc gag aac 302 Ala Asn Leu Ile Glu Asn Leu Arg Gly Gly Ala Val Glu Ser Glu Asn 65 70 75 gaa aaa cag gag agt tat ggt aag gag gag ggg gcg aaa gtg aag gag 350 Glu Lys Gln Glu Ser Tyr Gly Lys Glu Glu Gly Ala Lys Val Lys Glu 80 85 90 aat gga gaa act agt aat ggc aac gga act gat gtt atg gcc gtc aaa 398 Asn Gly Glu Thr Ser Asn Gly Asn Gly Thr Asp Val Met Ala Val Lys 95 100 105 110 ttc aca ttc agg ccg gcg gcg cct gct cac cgc aaa aat aag gag agt 446 Phe Thr Phe Arg Pro Ala Ala Pro Ala His Arg Lys Asn Lys Glu Ser 115 120 125 cct ctt agc tcc gac gcc atc ttc aaa cag agc cat gca ggc ctc ttc 494 Pro Leu Ser Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe 130 135 140 aac ctt tgt ata gtg gtg ctt gtt gct gta aat agc aga cta ata att 542 Asn Leu Cys Ile Val Val Leu Val Ala Val Asn Ser Arg Leu Ile Ile 145 150 155 gag aat tta atg aag tat ggg tgg ctg atc aaa tca gga ttt tgg ttt 590 Glu Asn Leu Met Lys Tyr Gly Trp Leu Ile Lys Ser Gly Phe Trp Phe 160 165 170 agt tca aca tcg ctt agg gat tgg cca ctg cta atg tgt tgt ctt agt 638 Ser Ser Thr Ser Leu Arg Asp Trp Pro Leu Leu Met Cys Cys Leu Ser 175 180 185 190 ctt cca gtt ttt gca ctc gct tca ttt ctt gtc gag aag ttg gtg aaa 686 Leu Pro Val Phe Ala Leu Ala Ser Phe Leu Val Glu Lys Leu Val Lys 195 200 205 cta aat tat ata cct gag tgg gtc gca gtc ttt ctt cat gtt aca atc 734 Leu Asn Tyr Ile Pro Glu Trp Val Ala Val Phe Leu His Val Thr Ile 210 215 220 aca aca gtg gaa atc ttg ttt cca gtt gtt gtc att ctt agg tgt gat 782 Thr Thr Val Glu Ile Leu Phe Pro Val Val Val Ile Leu Arg Cys Asp 225 230 235 tct gct gtt cta tca ggt gtc acg cta atg ctc ttt gct tgc act gta 830 Ser Ala Val Leu Ser Gly Val Thr Leu Met Leu Phe Ala Cys Thr Val 240 245 250 tgg ttg aag ctc gtt tcc tac gca cat aca aac tat gat ttg aga gta 878 Trp Leu Lys Leu Val Ser Tyr Ala His Thr Asn Tyr Asp Leu Arg Val 255 260 265 270 ctt gca aaa tca ctt gat aag tgg gaa gct atg tcc agg tac tgg aac 926 Leu Ala Lys Ser Leu Asp Lys Trp Glu Ala Met Ser Arg Tyr Trp Asn 275 280 285 ctc gac tac gct tat gat gta agc ttt aag agt ctg gca tac ttc atg 974 Leu Asp Tyr Ala Tyr Asp Val Ser Phe Lys Ser Leu Ala Tyr Phe Met 290 295 300 gtt gct cct aca ttg tgt tac cag cca agc tac cct cgg aca gct tgc 1022 Val Ala Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Thr Ala Cys 305 310 315 att cgg aag ggt tgg gtg gta agg caa cta att aag ctg gta ata ttc 1070 Ile Arg Lys Gly Trp Val Val Arg Gln Leu Ile Lys Leu Val Ile Phe 320 325 330 aca gga ctc atg gga ttt att ata gaa cag tac ata aac ccg atc gtt 1118 Thr Gly Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val 335 340 345 350 caa aat tct caa cat cct ctg aaa gga aac ctt tta tat gcc att gag 1166 Gln Asn Ser Gln His Pro Leu Lys Gly Asn Leu Leu Tyr Ala Ile Glu 355 360 365 agg gtc ttg aag ctt tct gtt cca aat tta tat gtg tgg ctc tgc atg 1214 Arg Val Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met 370 375 380 ttt tat tgt ttt ttc cac ctc tgg cta aat ata ctt gct gaa ctt ctg 1262 Phe Tyr Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu 385 390 395 tgc ttt ggg gac cgt gag ttt tat aag gat tgg tgg aat gcg agg aca 1310 Cys Phe Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Arg Thr 400 405 410 gtg gag gag tac tgg aga atg tgg aat atg cct gtc cat aaa tgg atg 1358 Val Glu Glu Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met 415 420 425 430 gtt cgg cat ata tac tgt cca tgc tta caa aat gga ata cca aag ata 1406 Val Arg His Ile Tyr Cys Pro Cys Leu Gln Asn Gly Ile Pro Lys Ile 435 440 445 gtg gca gtt ttg atc gcg ttt ctt gtg tct gcg att ttt cat gag ctg 1454 Val Ala Val Leu Ile Ala Phe Leu Val Ser Ala Ile Phe His Glu Leu 450 455 460 tgc gtt gca gtc cct tgc caa ata ttc aag ttt tgg gcg ttc tcg ggt 1502 Cys Val Ala Val Pro Cys Gln Ile Phe Lys Phe Trp Ala Phe Ser Gly 465 470 475 atc atg ctt cag gtt cct ctc gta atc gtg act aat tac ttg caa gaa 1550 Ile Met Leu Gln Val Pro Leu Val Ile Val Thr Asn Tyr Leu Gln Glu 480 485 490 aag ttc aaa aac tca atg gtg ggc aat atg atg ttc tgg tgc ttc ttc 1598 Lys Phe Lys Asn Ser Met Val Gly Asn Met Met Phe Trp Cys Phe Phe 495 500 505 510 tgt atc ttt ggt caa cct atg tgt gtg ttg ctg tac tac cac gac ttg 1646 Cys Ile Phe Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu 515 520 525 atg aat cga aaa gct agt gca agg tag ggatgtgatt catcttctga 1693 Met Asn Arg Lys Ala Ser Ala Arg * 530 gtagaaatct aaagctcacc agccccaacc cacccgaaaa acaaaaagga gcaaggatcc 1753 tgattgtgag ctggtagata atttgctaca actatgtttc ttaaatagct gggagtagtt 1813 tgttatctgc cttcacctag gacgacgtta tgatctgttg tgatgggggt aagggggcat 1873 gcaaattttg tctatttttc aaggaataca gaaatggtga aaatttgatg aagcataccc 1933 ctcgtttact gacaaaaaaa aaaaaaaaaa a 1964 <210> SEQ ID NO 15 <211> LENGTH: 534 <212> TYPE: PRT <213> ORGANISM: Piralla frutescens <400> SEQUENCE: 15 Met Ala Ile Leu Asp Ser Pro Glu Ile Leu Asp Thr Thr Ser Ser Ser 1 5 10 15 Ala Asp Asn Gly Ala Ala His His Thr Thr Leu Arg Arg Arg Gln Ser 20 25 30 Ala Arg Ser Val Pro Pro Leu Leu Asp Ser Asp Ser Asn Ser Leu Glu 35 40 45 Ala Glu Ser Ala Ile Asn Asp Ser Glu Asn Val Arg Asn Asp Ala Asn 50 55 60 Leu Ile Glu Asn Leu Arg Gly Gly Ala Val Glu Ser Glu Asn Glu Lys 65 70 75 80 Gln Glu Ser Tyr Gly Lys Glu Glu Gly Ala Lys Val Lys Glu Asn Gly 85 90 95 Glu Thr Ser Asn Gly Asn Gly Thr Asp Val Met Ala Val Lys Phe Thr 100 105 110 Phe Arg Pro Ala Ala Pro Ala His Arg Lys Asn Lys Glu Ser Pro Leu 115 120 125 Ser Ser Asp Ala Ile Phe Lys Gln Ser His Ala Gly Leu Phe Asn Leu 130 135 140 Cys Ile Val Val Leu Val Ala Val Asn Ser Arg Leu Ile Ile Glu Asn 145 150 155 160 Leu Met Lys Tyr Gly Trp Leu Ile Lys Ser Gly Phe Trp Phe Ser Ser 165 170 175 Thr Ser Leu Arg Asp Trp Pro Leu Leu Met Cys Cys Leu Ser Leu Pro 180 185 190 Val Phe Ala Leu Ala Ser Phe Leu Val Glu Lys Leu Val Lys Leu Asn 195 200 205 Tyr Ile Pro Glu Trp Val Ala Val Phe Leu His Val Thr Ile Thr Thr 210 215 220 Val Glu Ile Leu Phe Pro Val Val Val Ile Leu Arg Cys Asp Ser Ala 225 230 235 240 Val Leu Ser Gly Val Thr Leu Met Leu Phe Ala Cys Thr Val Trp Leu 245 250 255 Lys Leu Val Ser Tyr Ala His Thr Asn Tyr Asp Leu Arg Val Leu Ala 260 265 270 Lys Ser Leu Asp Lys Trp Glu Ala Met Ser Arg Tyr Trp Asn Leu Asp 275 280 285 Tyr Ala Tyr Asp Val Ser Phe Lys Ser Leu Ala Tyr Phe Met Val Ala 290 295 300 Pro Thr Leu Cys Tyr Gln Pro Ser Tyr Pro Arg Thr Ala Cys Ile Arg 305 310 315 320 Lys Gly Trp Val Val Arg Gln Leu Ile Lys Leu Val Ile Phe Thr Gly 325 330 335 Leu Met Gly Phe Ile Ile Glu Gln Tyr Ile Asn Pro Ile Val Gln Asn 340 345 350 Ser Gln His Pro Leu Lys Gly Asn Leu Leu Tyr Ala Ile Glu Arg Val 355 360 365 Leu Lys Leu Ser Val Pro Asn Leu Tyr Val Trp Leu Cys Met Phe Tyr 370 375 380 Cys Phe Phe His Leu Trp Leu Asn Ile Leu Ala Glu Leu Leu Cys Phe 385 390 395 400 Gly Asp Arg Glu Phe Tyr Lys Asp Trp Trp Asn Ala Arg Thr Val Glu 405 410 415 Glu Tyr Trp Arg Met Trp Asn Met Pro Val His Lys Trp Met Val Arg 420 425 430 His Ile Tyr Cys Pro Cys Leu Gln Asn Gly Ile Pro Lys Ile Val Ala 435 440 445 Val Leu Ile Ala Phe Leu Val Ser Ala Ile Phe His Glu Leu Cys Val 450 455 460 Ala Val Pro Cys Gln Ile Phe Lys Phe Trp Ala Phe Ser Gly Ile Met 465 470 475 480 Leu Gln Val Pro Leu Val Ile Val Thr Asn Tyr Leu Gln Glu Lys Phe 485 490 495 Lys Asn Ser Met Val Gly Asn Met Met Phe Trp Cys Phe Phe Cys Ile 500 505 510 Phe Gly Gln Pro Met Cys Val Leu Leu Tyr Tyr His Asp Leu Met Asn 515 520 525 Arg Lys Ala Ser Ala Arg 530 <210> SEQ ID NO 16 <211> LENGTH: 1181 <212> TYPE: DNA <213> ORGANISM: Zea mays <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: 235, 236, 237, 238, 239, 317, 318, 319, 320, 321, 322, 393, 394, 395, 396, 397, 398 <223> OTHER INFORMATION: n = A,T,C or G <400> SEQUENCE: 16 ccacgcgtcc ggtctcttat gcacatacaa attatgatat aagggtattg tccaaaagta 60 ctgagaaggg tgctgcatat ggaaattatg tcgatcctga gaatatgaaa gatccaacct 120 ttaaaagtct agtgtacttc atgttggccc caacactttg ttaccagcca acttatcctc 180 aaactacatg tattagaaag ggttgggtga cccagcaact cataaagtgc gtggnnnnna 240 caggcttgat gggcttcata attgagcaat atataaaccc aattgtgaag aattccaaac 300 atccactgaa agggaannnn nngaatgcta tagaaagagt cttaaaactc tcagtgccaa 360 cattatatgt atggctttgc atgttctatt gcnnnnnnca tttatggctg aacattgtag 420 ctgaactcct ctgtttcggt gaccgtgaat tctataagga ctggtggaat gccaaaactg 480 ttgaagagta ctggaggatg tggaacatgc ctgttcataa gtggatcatc agacacatat 540 attttccatg tataaggaaa ggcttttcca ggggtgtagc tattctaatc tcgtttctgg 600 tttcagctgt attccatgag atatgtattg cggtgccgtg ccacattttc aaattctggg 660 cattttctgg gatcatgttt cagataccgt tggtattctt gacaagatat ctccatgcta 720 cgttcaagca tgtaatggtg ggcaacatga tattttggtt cttcagtata gtcggacagc 780 cgatgtgtgt ccttctatac taccatgacg tcatgaacag gcaggcccag gcaagtagat 840 agttcggcag agacatgtac ttcaacatcg accatcagaa gcagactgga gcgacgcggc 900 aggaagcagc agtagcccct cacttgccat tgtttaccat cagctagcag ctggtcagcc 960 tgaaaatgct tctattccag gcggccgtgc tgggatgcag cgtgtacatt accagcgatg 1020 caacgtcaac ggttaccaag cgcggataag cagtgctcgt ggcatgtgaa ctctgtacac 1080 attgaagtct atatgtccgc ggtgctgtgt gcataatgtc gtggtagatg taatatgtac 1140 aatgtatgcc acgacttaag aggtcaaaat gaggctcacg t 1181 <210> SEQ ID NO 17 <211> LENGTH: 1572 <212> TYPE: DNA <213> ORGANISM: Zea mays <400> SEQUENCE: 17 ttttggttta atgctacatc attgcgagac tggccactgc taatgtgttg ccttagtcta 60 cccatatttc cccttggtgc atttgcagtc gaaaagttgg cattcaacaa tctcgttagt 120 gatcctgcta ctacctgttt tcacatcctt tttacaacat ttgaaattgt atatccagtg 180 ctcgtgattc ttaagtgtga ttctgcagtt ttatcaggct ttgtgttgat gtttattgcc 240 tgcattgttt ggctgaagct tgtatctttt gcacatacaa accatgatat aagaaaactg 300 atcacaagcg gcaagaaggt tgataatgaa ctgaccgcgg ctggcataga taatttacaa 360 gctccaactc ttgggagtct aacatacttc atgatggctc cgacactctg ttatcagcca 420 agttatcctc gaacacctta tgttagaaaa ggttggctgg tccgtcaagt tattctatac 480 ttgatattta ctggtctcca aggattcatt attgagcaat acataaatcc tattgttgtg 540 aactctcaac atccattgat gggaggatta ctgaatgctg tagagactgt tttgaagctc 600 tcattaccaa atgtctacct gtggctttgc atgttttatt gccttttcca tctgtggtta 660 aacatacttg ctgagattct tcgatttggt gaccgagaat tctacaaaga ctggtggaat 720 gcaaagacaa ttgatgagta ctggagaaaa tggaacatgc ctgtgcataa atggattgtt 780 cgtcatatat attttccttg catgcgaaat ggtatatcaa aggaagttgc tgtttttata 840 tcgttctttg tttctgctgt acttcatgag gtaacttatt tactttttca ctcttcatct 900 gcatatatta attatatagt tctctatttt caaatgtgtc ctttcgagtt tcgacatgct 960 tttgttcaaa cttaccagct gtagattact tggatgaagt gctctatata aaattcaata 1020 tttcacaatc cagtcccttt cgagaaaatt atgatacatt ttgtttgcat ttgtacacca 1080 gttatgcgtt gcagttccct gccacatact caagttctgg gctttcttag gaatcatgct 1140 tcagattccc ctcatcatat tgacatcata cctcaaaaat aaattcagtg acacaatggt 1200 tggcaatatg atcttttggt tttttttctg catatacggg cagccaatgt gtgttctatt 1260 gtattaccat gatgtgatga accggactga gaaggcaaaa taaccatctg tagatctttt 1320 ttggtgtttc atttctgcca tcatggaaac tgaaagcaat aatctgtgca cacagtaaac 1380 cagcatcgtg tcttccagtt ttctttttgt tgttggaatc tatcctagat ctttatcatg 1440 tgtatggtgg ataacctcat gtcaccatcg tatctgtata caataagcct aaatcagctg 1500 acgttatata tgtaaattag taaatgtaat gactaattag tgccaagaaa aaaaaaaaaa 1560 aaaaaaaaaa aa 1572 

What is claimed is:
 1. A plant polynucleotide present in other than its naturally occurring environment encoding a product having DGAT activity.
 2. The polynucleotide of claim 1, wherein said polynucleotide comprises a region of at least about 50 nucleotides in length that has at least about 75% sequence identity to a region of SEQ ID NO:
 1. 3. The polynucleotide of claim 2, wherein said polynucleotide comprises a region of at least about 50 nucleotides in length that has at least about 90% sequence identity to a region of SEQ ID NO:
 1. 4. The polynucleotide of claim 3, wherein said polynucleotide comprises a region of at least about 50 nucleotides in length that has at least about 95% sequence identity to a region of SEQ ID NO:
 1. 5. The polynucleotide of claim 4, wherein said polynucleotide comprises a region of at least about 50 nucleotides in length that is identical to a region of SEQ ID NO:
 1. 6. The isolated polynucleotide of claim 1, wherein said polynucleotide is a monocot polynucleotide
 7. The isolated polynucleotide of claim 1, wherein said polynucleotide is a dicot polynucleotide
 8. The isolated polynucleotide of claim 1, wherein said polynucleotide is a soybean, maize, or canola polynucleotide.
 9. A nucleic acid that hybridizes under stringent conditions to the nucleic acid of SEQ ID NO: 1, wherein said nucleic acid encodes a DGAT polypeptide or is the complement of a nucleic acid that encodes a DGAT polypeptide.
 10. A composition comprising a plant DGAT polypeptide present in other than its naturally occurring environment, wherein said composition exhibits a DGAT activity of at least about 100 pmol triglycerides formed/mg protein/min.
 11. The DGAT polypeptide composition according to claim 10, wherein said polypeptide has an amino acid sequence of a naturally occurring DGAT polypeptide.
 12. A fragment of a DGAT polypeptide.
 13. The DGAT polypeptide fragment according to claim 12, wherein said polypeptide fragment is encoded by a polynucleotide comprising the sequence of SEQ ID NO:1.
 14. An antibody binding specifically to a plant polypeptide having DGAT activity.
 15. An expression cassette comprising the polynucleotide of claim
 1. 16. A cell comprising an expression cassette of claim
 15. 17. The cell according to claim 16, wherein said expression cassette is part of an extrachromosomal region or integrated into a chromosome of said cell.
 18. The cell according to claim 16, wherein said cell is a plant cell.
 19. A method of producing a polypeptide having plant DGAT activity, said method comprising: a) growing a cell according to claim 16 under conditions sufficient to express said polypeptide; and b) harvesting said polypeptide.
 20. A transgenic plant comprising an expression cassette of claim
 15. 21. The transgenic plant of claim 20, wherein at least one tissue of said plant is altered in DGAT activity as compared to wild type.
 22. The transgenic plant according to claim 20, wherein said plant is modified in at least one DGAT trait.
 23. The transgenic plant according to claim 22, wherein said DGAT trait is chosen from TAG content, TAG composition, DGAT protein content and total protein content.
 24. The transgenic plant according to claim 22, wherein said trait is a seed trait.
 25. The transgenic plant according to claim 24, wherein said seed trait is an increase in TAG content in seeds.
 26. The transgenic plant according to claim 24, wherein said seed trait is an increase in total protein content in seeds.
 27. Seeds of the transgenic plant according to claim
 24. 28. A method to identify plant DGAT-encoding polynucleotides comprising: employing a probe comprising a sequence substantially similar or identical to SEQ ID NO:1.
 29. The method according to claim 28, wherein said probe is a nucleic acid molecule.
 30. The method according to claim 29, wherein said probe is hybridized against a library of plant-derived nucleic acid fragments under stringent conditions.
 31. The method according to claim 28, wherein said probe is a virtual probe.
 32. The method according to claim 31, wherein said method comprises employing a nucleic acid sequence comparison program and a database comprising plant polynucleotide sequences.
 33. The method according to claim 32, wherein said method employs a BLAST algorithm.
 34. The method according to claim 33, wherein said BLAST algorithm is chosen from BLASTN and TBLASTX.
 35. A method of producing a plant having a modified DGAT activity, the method comprising: introducing an isolated polynucleotide of claim 1 into a cell of a plant to produce said plant.
 36. The method according to claim 35, wherein said polynucleotide is an inhibitory polynucleotide.
 37. The method according to claim 35, wherein said plant is modified in a DGAT trait.
 38. The method according to claim 37, wherein said DGAT trait is chosen from TAG content, TAG composition, DGAT protein content and total protein content
 39. The method according to claim 37, wherein said trait is a seed trait.
 40. A plant produced by the method of claim
 35. 41. Seeds produced by the plant of claim
 40. 42. In a method of producing oil from seeds, the improvement comprising: producing oil from said seeds of claim
 41. 43. A method of producing a triacylglycerol, said method comprising: contacting a diacylglycerol and fatty acid CoA with the plant DGAT polypeptide of claim 10 under conditions sufficient for said triacylglycerol to be produced. 